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ART SEARCH RECORD 



Patent Examiner Francis Moonan 
US Patent Application : 09/622978 
Inventor : Hull, Roger; Harper, Glyn 



EAST Search. 



Databases=USPAT; US-PGPUB; EPO; JPO; DERW 
Date= 8 November 2001 



Query=(Schenk.IN. or Sagi.IN. or Remy.IN. or Swennen.IN. or 

Dietzen.IN. or Geering.IN. or McMichael.IN. or thomas.IN or Grof.IN or 

Elliott.IN. or Hull.IN or Harper.IN or Lockhart.rN.)and (sugarcane or 

banana or plantain) 

Results=4 

Art Considered =^ 4 

Adams et al. 23 February 1999. 

Methods and compositions fro the production of stablytransformed fertile 
monocot plants and cells thereof. 
US Patent No. 5,874,265 

Olszewski et al. 25 February 1999. 
Sugarcane bacilliform virus promoter. 
WIPO No. wo 99/09190 

Olszewski et al. 30_Noyember 1999. 
Sugarcane bacilliform virus promoter. 
US Patent No. 5,994,123 

Olszewski et al. 25 July 2000. 
Sugarcane baciUiform virus promoter. 
US Patent No. 5,994,123 



Databases = AGRICULTURE 
Date=9 November 2001 



OuerY =Schenk or Sagi or Remy or Swennen or Dietzen or Geering or 
McMichael or Thomas or Grof or Elliott or Hull or Harper or Lockhart 
and sugarcane or banana or plantain not mammal? not enzym? and virus 
or viral and pararetrovirus or badnavirus 

Results ^ 146 

Art Considered= 14 



II. 



JUAtee/STN Search. 



L8 ANSWER 1 OF 146 CAPLUS COPYRIGHT 2001 ACS 

ACCESSION NUMBER: 2001:763212 CAPLUS 

TITLE: Eukaryotic recombinant bidirectional transcription 

vector to effect gene silencing via dsRNA-mediated 

inhibition of gene expression 
INVENTOR(S): Palmer, Kenneth E.; Pogue, Gregory P. 

PATENT ASSIGNEE(S): Large Scale Biology Corporation, USA 
SOURCE: PCT Int. AppL, 73 pp. 

CODEN: PIXXD2 
DOCUMENT TYPE: Patent 
LANGUAGE: English 
FAMILY ACC. NUM. COUNT: 1 
PATENT INFORMATION: 

PATENT NO. KIND DATE APPLICATION NO. DATE 



WO2001077350 A2 20011018 WO 2001 -US 11436 20010404 

W- AE, AG, AL, AM, AT, AU, AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, 
CO, CR, CU, CZ, DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, LS, 
LT LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, NO, NZ, PL, PT, RO, 
RU, SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, US, UZ, 
VN YU, ZA, ZW, AM, AZ, BY, KG, KZ, MD, RU, TJ, TM 
RW: GH, GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW, AT, BE, CH, CY, 
DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE, TR, BF, 
B J CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG 
PRIORITY APPLN. INFO.: US 2000-545574 A 20000407 



LS ANSWER 14 OF 146 BIOSIS COPYRIGHT 2001 BIOSIS DUPLICATE 6 

ACCESSION NUMBER: 2000:216163 BIOSIS 

DOCUMENT NUMBER: PREV200000216163 

TITLE: The rice tungro bacilliform virus gene II product interacts 

with the coat protein domain of the viral gene III 

polyprotein. 

AUTHOR(S): Herzog, Etienne; Guerra-Peraza, Orlene; Hohn, Thomas (1) 
CORPORATE SOURCE: (1) Friedrich Miescher Institute, Maulbeerstrasse 66, 

CH-4058, Basel Switzerland 
SOURCE: Journal of Virology, (March, 2000) Vol. 74, No. 5, pp. 

2073-2083. 

ISSN: 0022-538X. 
DOCUMENT TYPE: Article 
LANGUAGE: English 
SUMMARY LANGUAGE: English 



L8 ANSWER 15 OF 146 AGRICOLA DUPLICATE 7 

ACCESSION NUMBER: 2001:28866 AGRICOLA 
DOCUMENT NUMBER: IND22298361 

TITLE: Characterization and genomic analysis of tobacco vein 

clearing virus, a plant pararetrovims that 
is transmitted vertically and related to sequences 
integrated in the host genome. 

AUTHOR(S): Lockhart, B.E.; Menke, J.; Dahal, G.; Olszewski, N.E. 
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AVAILABILITY: DNAL (QR360.A1 J6) 

SOURCE: The Journal of general virology, June 2000. Vol 81, 

No. pt.6. p. 1579-1585 

Publisher: Reading : Society for General Microbiology. 

CODEN: JGVIAY; ISSN: 0022-1317 
NOTE: Includes references 

PUB. COUNTRY: England; United Kingdom 

DOCUMENT TYPE: Article 

FILE SEGMENT: Non-U.S. Imprint other than FAO 
LANGUAGE: English 



L8 ANSWER 25 OF 146 AGRICOLA DUPLICATE 13 

ACCESSION NUMBER: 2000:46607 AGRICOLA 
DOCUMENT NUMBER: IND22046496 

TITLE: Integrated pararetro viral sequences define a unique 

class of dispersed repetitive DNA in plants. 
AUTHOR(S): Jakowitsch, J.; Mette, M.F.; Winden, J. van der.; 

Matzke, M.A.; Matzke, A.J.M. 
CORPORATE SOURCE: Institute of Molecular Biology, Salzburg, Austria. 
SOURCE: Proceedmgs of the National Academy of Sciences of the 

United States of America, Nov 9, 1999. Vol. 96, No. 

23. p. 13241-13246 

Publisher: Washington, D.C. : National Academy of 
Sciences, 

CODEN: PNASA6; ISSN: 0027-8424 
NOTE: Includes references 

PUB. COUNTRY: District of Columbia; United States 

DOCUMENT TYPE: Article; Conference 

FILE SEGMENT: U.S. Imprints not USD A, Experiment or Extension 

LANGUAGE: English 

L8 ANSWER 26 OF 146 CAPLUS COPYRIGHT 2001 ACS DUPLICATE 14 
ACCESSION NUMBER: 1999:523005 CAPLUS 
DOCUMENT NUMBER: 131 :267862 

TITLE: A short open reading frame terminating in front of a 

stable hairpin is the conserved feature in pregenomic 

RNA leaders of plant pararetro viruses 
AUTHOR(S): Pooggin, Mikhail M.; Futterer, Johannes; Skryabin, 

Konstantin G.; Hohn, Thomas 
CORPORATE SOURCE: Friedrich Miescher Institute, Basel, CH-4002, Switz. 
SOURCE: J. Gen. Virol. (1999), 80(8), 2217-2228 

CODEN: JGVIAY; ISSN: 0022-1317 
PUBLISHER: Society for General Microbiology 

DOCUMENT TYPE: Journal 
LANGUAGE: English 
REFERENCE COUNT: 54 

REFERENCE(S): (1) Bacharach, E; Joumal of Virology 1998, V72, P6944 

CAPLUS 

(2) Bao, Y; Virology 1993, V197, P445 CAPLUS 

(3) Bhattacharyya-Pakrasi, M; Plant Joumal 1993, V4, 
P71 CAPLUS 

(4) Bonneville, J; Cell 1989, V59, PI 135 CAPLUS 

(5) Bouhida, M; Joumal of General Virology 1993, V74, 
PI 5 CAPLUS 

ALL CITATIONS AVAILABLE IN THE RE FORMAT 



3 



L8 ANSWER 27 OF 146 Elsevier BIOBASE COPYRIGHT 2001 Elsevier Science B.V. 
DUPLICATE 

ACCESSION NUMBER: 1999132177 Elsevier BIOBASE 
TITLE: A promoter from sugarcane bacilliform 

badnavirus drives transgene expression in 

banana and other monocot and dicot plants 
AUTHOR: Schenk P.M.; Sagi L.; Remans T.; Dietzgen R.G.; 

Bernard M.J.; Graham M.W.; Manners J.M. 
CORPORATE SOURCE: P.M. Schenk, Coop. Res. Trop. Plant Pathol. Ctr., 

University of Queensland, JohnHines Building, St. 

Lucia, QLD 4072, AustraHa. 
^ SOURCE: Plant Molecular Biology, (1999), 39/6 (1221-1230), 39 

reference(s) 

CODEN: PMBIDB ISSN: 0167-4412 
DOCUMENT TYPE: Journal; Article 
COUNTRY: Netherlands 
LANGUAGE: English 
SUMMARY LANGUAGE: English 



L8 ANSWER 29 OF 146 AGRICOLA DUPLICATE 17 

ACCESSION NUMBER: 2000:15812 AGRICOLA 
DOCUMENT NUMBER: IND22025459 
TITLE: Structure and promoter/leader deletion analysis of 

mirabilis mosaic virus (MMV) full-length transcript 

promoter in transgenic plants. 
AUTHOR(S): Dey, N.; Maiti, LB. 

CORPORATE SOURCE: University of Kentucky, Lexington, KY. 
AVAILABILITY: DNAL (QK710.P62) 

SOURCE: Plant molecular biology, July 1999. Vol. 40, No. 5. p. 

771-782 

Publisher: Dordrecht : Kluwer Academic Publishers. 

CODEN:.PMBIDB; ISSN:_0167-4412 
NOTE: Includes references 

PUB. COUNTRY: Netherlands 
DOCUMENT TYPE: Article 

FILE SEGMENT: Non-U.S. Imprint other than FAO 
LANGUAGE: English 



L8 ANSWER 106 OF 146 AGRICOLA DUPLICATE 57 

ACCESSION NUMBER: 93:51829 AGRICOLA 
DOCUMENT NUMBER: IND9303 1 884 

TITLE: An analysis of the complete sequence of a sugarcane 

bacilliform virus genome infectious to banana and 
rice. 

AUTHOR(S): Bouhida, M.; Lockhart, B.E.L.; Olszewski, N.E. 

CORPORATE SOURCE: Institut Agronomique Hassan II, Agadir, Morocco 
AVAILABILITY: DNAL (QR360.A1 J6) 

SOURCE: The Journal of general virology, Jan 1993. Vol. 74, 

No. pt.l.p. 15-22 

Publisher: Reading : Society for General Microbiology. 

CODEN: JGVIAY; ISSN: 0022-1317 
NOTE: Includes references. 

DOCUMENT TYPE: Article 
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V 



FILE SEGMENT: Non-U.S, Imprint other than FAO 

LANGUAGE: English 



L8 ANSWER 134 OF 146 GENBANK.RTM. COPYRIGHT 2001 

LOCUS (LOC) : NTA4 14165 GenBank (R) 

GenBank ACC. NO. (GBN): AJ414165 

CAS REGISTRY NO. (RN): 361 143-48-8 

SEQUENCE LENGTH (SQL): 4806 

MOLECULE TYPE (CI): DNA; linear 

DIVISION CODE (CI): Plants, fungi, algae 

DATE (DATE): 3 Oct 2001 

DEFINITION (DEF): Nicotiana tabacum ORFl , 0RF2 and partial 0RF3, 

pararetro virus-like sequence, V2, 
KEYWORDS (ST): coat protein; movement protein; ORE 1 ; 0RF2; 0RF3 ; 

polyprotein 
SOURCE: common tobacco. 

ORGANISM (ORGN): Nicotiana tabacum 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; 

Tracheophyta; Spermatophyta; Magnoliophyta; 

eudicotyledons; core eudicots; Asteridae; euasterids I; 

Solanales; Solanaceae; Nicotiana 
NUCLEIC ACID COUNT (NA): 2445 a 542 c 699 g 1120t 
REFERENCE: 1 (bases 1 to 4806) 

AUTHOR (AU): Jakowitsch,!.; Mette,M.F.; van Der Winden,J.; 

Matzke,M.A.; Matzke,AJ. 
TITLE (TI): Integrated pararetro viral sequences define a unique 

class of dispersed repetitive DNA in plants 
JOURNAL (SO): Proc. Natl. Acad. Sci. U.S.A., 96 (23), 13241-13246 

(1999) 

OTHER SOURCE (OS): CA 132:103712 
REFERENCE: 2 (bases 1 to 4806) 
AUTHOR (AU): van der Winden,J. 
TITLE (TI): ___ _ Direct Submission 

JOURNAL (SO): Submitted (25-SEP-2001) van der Winden J., Plant 
Molecular Genetics, Institute of Molecular Biology, 
Bilhothstrasse 11, 5020 Salzburg, AUSTRIA 

FEATURES (FEAT): 

Feature Key Location Qualifier 
==^==.=^====+=================+-=========--== 

source 1 . .4806 /organism="Nicotiana tabacum" 

/db-xref="taxon:4097" 
mRNA 202.. 1998 /note="ORF 1 " 

exon 202. . 1 998 /note="ORF 1 " 

/number=^l 

CDS 202.. 1998 /note="ORFl" 

/codon-start==l 

/product="putative coat protein" 

/protein-id="CAC88783.1" 

/db-xref=="GI: 15963357" 

/translation-"MNKEEFALEEKTYENPEGLK 

ITIIFSNLGRRYKKIGNNLNLMLE 

KETVKLEDSLTAMVRITKENEEIDRKREIKEIQQ 

QAKEKIQQIEEVKNTKITELEKEL 

EMLKQMYANKLKEKEKRKEEEEELKLTNEIERFK 
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LQLEEVQESPSKISINEINDQSDK 

DTELGENYSETSETYTELIEKTEKIKINPEINTG 

DMNEDKPSISGIKNPKQLNPTYYR 

VSYDAYDRNKTLWDKRLNKKWAPRQITEQYNFLD 

LDCVADINKTIQLWIGYISKQLID 

NKITITETPGYIERTLIGTVKLWLQNLSSESLDT 

LRSNKKLDGTTTTTVTDILNKYEI 

AIRNEFSSMTTEVEEQNKEKITNRNLMTKLAICN 

MCYIDEYTCAFRDYYYKGTYSPDE 

SBCEIRKLYFTKLPEPFSSKIIKSWNEAGLADTLG 

VRIKFLQNWFIELCEKYKENMKME 

KIL\^LACCKSRIAPQFGCTDKYYKKEGKKKKF 

KSKYSKYKYRKPRRRYYVKNYKHK 

KPYRKKKKLTECTCYNCGKLGHLAKDCKLPKNPK 

KKQITEILIDNDKYTQVEYVDYEL 

SSEDSMYEISENEFSESEINKDIEESDEENYD" 

mRNA 1979..2932 /note-"ORF2'* 

CDS 1979..2932 /note="ORF2 truncated" 

/codon-start=l 

/product="putative movement 
protein" 

/protein-id="CAC88784.1" 

/db-xref="GI:15963358" 

/translation="MKKIMTEKDIQIIQQEEHQD 

EQSSEQKIIFDANIFEQIKGKELD 

LSIDKILEVPTIKNWFBCRQKEEYYVVSQREHIID 

CKYIKGKAQIPILNKRLLNKEIQD 

IKAKNPIKYVHLGGTEILTKACFREGIDTPIEIY 

LADDRIVQPIEKSIISAVRGNLIY 

KKLKSIVSANYSVAVDDKNIDKSLVLYWKMSGIE 

LAPGSKIFTARCKNLYVLTTKHKI 

TAKNKE^JKIKIENPFERIVSVIDNNDYSYTEIDM 

DEDLEIVKERLSTSKRINNEMPET 

SSRSTSRSTSKRINYTTPQKLIEQKIEEINPHHY 

YITGIMD" 

exon 1979..2932 /note="ORF2" 

/number=l 

exon 3256„>4806 /note="ORF3" 
/number=l 

mRNA 3256.>4806 /note="ORF3" 

CDS 3256..>4806 /note-"ORF3 aspartic proteinase, 

reverse transcriptase, 

ribonuclease H" 

/codon-start=l 

/product="putative polyprotein" 

/protein-id="CAC88785.1" 

/db-xref="GI: 15963359" 

/translation="MPKIYILSKIIVEGYYNRYY 

TPMVDTGAEANMCRHNCLPESKWE 

KLKNPIWTGFNNEGSMITYKARNIKIQIWDKIL 

TIEEIYSYEFTNKDILLGMPFLDK 

LYPHIITKTHWWFTTPCKQKLGAKRVNNKVRKTT 

PWIKGSEKITQKLENIIQSNHNIE 

IIIFSINKIKPLQDKLELLYNDNPLQGWEKHQTK 

IKIELIDENSIITQKPLKYNFNDL 

TEFKMHIKELLDNNYIQESNSKHTSPAFIVNKHS 
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EQKRGKSRMVIDYRNLNAKTKTYN 
YPIPNKILKIRQIQGYNYFSKFDYKSGFYHLKLE 
DESKKLTAFTVPQGFYEWNVLPFG 
YKNAPGRYQHFMDNYFNQLENCIIYIDDILLYSR 

TENEHIKLLEKFIHIVEISGISLS 

KKKAEVMKNQIEFLGIQMDKNGIKMQTHVVQKII 

NLNETLDTKKKLQSFLGLVNQVRE 

YIPKLAENLKPLQKKLKKDIEYHFDEKDKIHIQK 

IKNMCKKLPKLYFPDEKKQFTYIV 

ETDSSDRVD" 

SEQUENCE (SEQ): 

1 cattaatagt acaaacccgc aaaacagtga aacaatcaaa ataactaaga aatcgcagaa 
61 aattaggaag aaactaaaga aactatatat agaatatgaa tacttaagta tcacaaaaat 
121 aaacaaagct aggttgtcca aattaatcga tataatatct aaaacagaat ataaatatgt 
181 ttgttatcta aaggataaaa gatgaataaa gaagaattcg cattagaaga gaaaacatat 
241 gagaatccag aaggattaaa aataacaata atattttcta acttaggaag aagatataaa 
301 aaaataggaa ataacctaaa cttaatgtta gaaaaagaaa ctgtaaaact agaggatagt 
361 ttaaccgcca tggttagaat aacaaaagaa aacgaagaaa tagatagaaa acgagagatt 
421 aaagaaatac aacagcaagc taaagaaaaa atacaacaga tagaggaagt aaaaaacact 
481 aaaataacag aattagaaaa agaattagag atgctaaaac agatgtatgc aaataaacta 
541 aaagaaaagg aaaaacgtaa agaagaagaa gaagaactaa aactaacaaa tgagatagaa 
601 agattcaaat tacagttaga agaagtacag gagagtccat caaaaataag tataaacgaa 
661 ataaatgacc aaagtgataa agacacagaa ctaggagaaa actattcaga aacaagtgaa 
721 acatacacag aacttataga aaaaacagaa aagataaaaa taaacccaga aataaataca 
781 ggagatatga acgaagataa accaagcata tcaggaataa aaaatccaaa acaattaaac 
841 ccaacctatt acagagtaag ttatgatgca tatgacagaa acaaaacatt atgggataaa 
901 aggttaaata agaaatgggc accaagacag ataactgaac aatataattt tttagatcta 
961 gattgtgtag cagatataaa taaaacaata caattatgga taggatatat ctcaaaacaa 
1021 ctaatagata ataaaataac aataacggaa acaccaggat atatagaaag aacattaata 
1081 ggaactgtaa aattatggtt acaaaattta tcaagtgaaa gcttagacac attaaggagt 
1141 aataaaaaac ttgacggtac aactacaaca acagttacag atatattaaa taaatatgaa 
1201 atagcaataa gaaatgaatt tagtagtatg acaacagaag tagaagaaca aaataaagaa 
1261 aaaattacaa atagaaattt aatgacaaaa ttagcaatat gtaatatgtg ttatatagat 
1321 gaatatactt gtgcatttag agactattat tataaaggga catatagtcc agatgaaagt 
1381 aaagagataa gaaaattata ttttacaaaa ttaccagaac cctttagctc aaaaataata 
1441 aaaagttgga acgaagcagg actagcagat acattaggag taaggataaa atttctacaa 
1501 aactggttta tagaattatg tgaaaaatat aaagaaaaca tgaaaatgga aaaaatatta 
1561 gtaaaaaatt tggcatgttg caaaagtaga atagcacccc aatttggctg cacagataaa 
1621 tattacaaaa aagaaggaaa gaaaaagaaa tttaaatcaa aatattcaaa gtataaatat 
1681 aggaaaccaa gaagaagata ttatgtaaaa aattataaac ataaaaaacc atacaggaaa 
1741 aagaaaaaac taacagaatg tacttgctat aattgtggaa agctaggaca cttagccaaa 
1801 gattgtaaat taccaaaaaa cccaaagaag aaacaaatta ccgaaatatt gatagataat 
1861 gataaatata cacaagtaga atatgtagat tatgaattaa gcagcgaaga cagcatgtat 
1921 gagatatcag aaaacgagtt ctctgaaagt gaaataaaca aggatataga agagtcagat 
1981 gaagaaaatt atgactgaaa aagatataca aattatacaa caagaagaac accaagatga 
2041 acaatcttca gaacaaaaaa taatatttga cgctaatata tttgaacaaa taaaaggaaa 
2101 agaattggat ctaagtatag acaaaatatt agaagtacca acaataaaga attggtttaa 
2161 aagacaaaaa gaagaatact atgtagtaag ccaaagagaa catataatag attgtaaata 
2221 catcaaaggt aaagcacaaa taccaatttt aaataaaaga ctactaaata aagaaataca 
2281 agatataaaa gcaaaaaatc caataaaata tgtacactta ggaggaacgg agatcctaac 
2341 aaaagcatgt tttagggaag gaatagatac ccctatagaa atatacttgg cagatgatag 
2401 aattgtacaa cctatagaaa agagtataat aagtgctgta agaggtaact taatatacaa 
2461 aaaattaaaa tctatagtaa gtgctaacta ttcagtagca gtagatgata aaaatataga 
2521 taaatcatta gtattatact ggaaaatgtc tggaatagaa ttagcaccag gaagtaaaat 
2581 attcacagca agatgtaaaa atctatatgt cttaacaaca aaacataaga taacagctaa 



2641 aaataaaata aataaaataa aaatagaaaa tccattcgaa agaatagtat cagttataga 
2701 caacaatgat tacagttata cagaaataga catggatgaa gatttagaaa tagtaaaaga 
2761 aagattaagc acatcaaaac gaataaataa tgaaatgcca gaaacatcat caagaagtac 
2821 atcaagaagt acatcaaaaa gaataaatta taccactcca caaaaattaa tagaacaaaa 
2881 aatagaagaa ataaacccac atcattatta tataacagga ataatggact aaagaaaata 
2941 tttaatacta ataaatacag ggcaggaaga gaattatgtt ataagagaac taataccaga 
3001 acaagagata gtaacaatag aacaacaaaa ttcagaacta ccaaaagcgt taagaaaaaa 
3061 cgaagaaaca actgaaaaag aattaattat tggaggaata ccaatattaa taaattttaa 
3121 aatatatcaa ggagataaaa atattacact agggataaaa tggttagaaa aagtcaaacc 
3181 atataaatta gaagatagac aattaacaat aagttatgaa aataagaaaa taataataaa 
3241 aagaactttg atataatgcc aaagatatac atactttcca aaataatagt agaaggatat 
3301 tataatagat attatacacc tatggtggat acaggagcag aagctaatat gtgtagacat 
3361 aattgtttac cagaaagtaa atgggaaaag ctaaaaaacc ccatagtagt aacaggattt 
3421 aataatgaag gtagtatgat aacatataaa gcaagaaata taaaaataca aatatgggat 
3481 aaaatattaa ccatagaaga aatatatagt tacgaattca caaataaaga tatattatta 
3541 ggaatgccat ttttagataa attataccca catattataa caaaaacaca ttggtggttt 
3601 actaccccgt gtaaacaaaa attaggagca aaaagagtaa ataataaagt aagaaaaaca 
3661 acaccctgga ttaaaggaag tgaaaagatt acccaaaaat tagaaaatat aatacaaagt 
3721 aaccataata tagagataat cattttctca ataaataaga taaaaccact acaagataaa 
3781 ctagaattac tatataatga taatccactc caaggatggg aaaaacatca aacaaaaata 
3841 aagattgaac taatagatga aaatagcata ataacacaaa aacctttaaa atacaatttt 
3901 aatgatttaa cagaatttaa aatgcatata aaagaattat tagataataa ctacatacaa 
3961 gaaagtaata gtaaacatac tagcccagca tttatagtaa ataagcatag tgaacaaaaa 
4021 agaggaaaaa gccgtatggt tatagattat agaaacttaa atgcaaaaac aaaaacatat 
4081 aattatccga taccaaataa aatactaaaa attagacaaa tacaaggata taactatttt 
4141 agtaaatttg actataaatc aggattttac catttaaaac tagaagatga atctaaaaag 
4201 ttaacagcat tcacagtacc acaaggattt tacgaatgga acgtattacc ttttggatat 
4261 aaaaatgcac caggtaggta tcaacatttt atggataatt acttcaacca attagaaaat 
4321 tgtataatat atatagatga tatattgcta tattctagaa cagagaacga acatataaaa 
4381 ctactagaaa aattcataca cattgtagaa atatcaggaa taagtttaag taaaaagaaa 
4441 gcagaagtaa tgaaaaatca aatagaattt ttaggtatac aaatggataa aaacggaata 
4501 aaaatgcaaa cccatgtagt acaaaaaata attaacttga atgaaacact tgatacaaaa 
4561 aagaagttac aatcattttt aggattggtt aaccaagtaa gagaatatat tcctaaatta 
4621 gcagaaaact taaaaccatt acagaaaaaa ttaaaaaagg acatagaata tcattttgac 
4681 gaaaaagata aaatacatat acagaagata aaaaatatgt gtaaaaaatt accaaaacta 
4741 tattttccag atgaaaagaa acaatttaca tatattgtag aaactgattc tagtgatcga 
4801 gtcgac 

L8 ANSWER 135 OF 146 GENBANK.RTM. COPYRIGHT 2001 

LOCUS (LOG): NTA414164 GenBank(R) 

GenBank ACC. NO. (GBN): AJ414164 

CAS REGISTRY NO. (RN): 361143-47-7 

SEQUENCE LENGTH (SQL): 6060 

MOLECULE TYPE (CI): DNA; linear 

DIVISION CODE (CI): Plants, fungi, algae 

DATE (DATE): 3 Oct 2001 

DEFINITION (DEF): Nicotiana tabacum partial 0RF2, 0RF3 and 0RF4, 

pararetrovirus-like sequence, V3. 
KEYWORDS (ST): movement protein; 0RF2; 0RF3 ; 0RF4; polyprotein; 

translatio; transactivator/inclusion body protein 
SOURCE: common tobacco. 

ORGANISM (ORGN): Nicotiana tabacum 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; 

Tracheophyta; Spermatophyta; Magnoliophyta; 

eudicotyledons; core eudicots; Asteridae; euasterids I; 

Solanales; Solanaceae; Nicotiana 
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NUCLEIC ACID COUNT (NA): 2724 a 857 c 915 g 1559 t 5 others 
REFERENCE: 1 (bases 1 to 6060) 

AUTHOR (AU): JakowitschJ.; Mette,M.F.; van Der WindenJ.; 

Matzke,M.A.; Matzke,AJ. 
TITLE (TI): Integrated pararetro viral sequences define a unique 

class of dispersed repetitive DNA in plants 
JOURNAL (SO): Proc. Natl Acad, Sci. U.S.A., 96 (23), 13241-13246 
(1999) 

OTHER SOURCE (OS): CA 132:103712 
REFERENCE: 2 (bases 1 to 6060) 

AUTHOR (AU): van der Winden,J. 
TITLE (TI): Direct Submission 

JOURNAL (SO): Submitted (24-SEP-200 1) van der Winden J., Plant 
Molecular Genetics, Institute of Molecular Biology, 
Billrothstrasse 11, 5020 Salzburg, AUSTRIA 

FEATURES (FEAT): 
Feature Key Location Qualifier 

source 1 . . 6060 /organism="Nico tiana tabacum" 

/db-xref-"taxon:4097" 
exon <3..1157 /note="ORF2" 

/number=l 

mRNA <3..1157 /note="ORF2" 

CDS <3..1157 /note-"ORF2" 

/codon-start=l 

/product="putative movement 
protein" 

/protein-id="CAC88780.1" 

/db-xref="GI: 15963353" 

/translation="RLDLSIDKILEVPTIKNWFK 

RQKEEYYVVSQREHIIDCKYIKGK 

AQIPILNKRLLNKEIQDIKAKNPIKYVHLGGTEI 

LIKACFREGIDTPIEIYLADDRIV 

QPIEKSIISAVRGNLIYQKFKFIVSANYSVAVDD 

KNIDKSLVLYWKMSGIELAPGSKI 

FTARCKNLYVLTTKHKITAKNIONKIKIENPFER 

IVSVIDNNDYSYTEIDMDEDLEIV 

KERLSTSKRINNEMPETSSRSTSRSTSKRINYTT 

PQKLIEQKIEEINPHHYYITGIMD 

QRKYLILESfTGQEENYVIRELIPEQEIVTIEQQN 

SELPKALRKNEETTEKELIIGGIP 

ILINFKIYQGDKNITLGIKWLEKVKPYKLEDRQL 

TISYENKKIIIKRTLI" 

mRNA 1 156.. 25 84 /note="ORF3 truncated 0RF3" 

exon 1156..2584 /note="ORF3" 

/number=l 

CDS 1 157..2584 /note="ORF3 aspartic proteinase, 

reverse transcriptase, 
ribonuclease H truncated" 
/codon-start=l 

/product="putative polyprotein" 
/protein-id="CAC88781.1" 
/db-xref="GI: 15963354" 
/translation="MPKIYILSKIIVEGYYNRYY 
TPMVDTGAETNMCRHNCLPESKWE 
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KLKTPIVVTGFNNEGSMITYKARNIKIQIWDKIL 
TIEEIYSYEFTNKDILLGMPFLDK 
LYPHIITKTHWWFTTPCKQKLGAKRVNNKVRKTT 
PWDCGSEKITQKLENIIQSNHNIE 
IIIFSINKIKPLQDKLELLYNDNPLQGWEKHQTK 
IKIELIDENNIITQKPLKYNFNDL 
TEFKMHIKELLDNNYIQESNSKHTSPAFIVNKHS 
EQKRGKSRMVIDYRNLNAKTKTYN 
YPIPNKILKIRQIQGYNYFSKFDCKSGFYHLKLE 
DESKKLTAFTVPQGFYEWNVLPFG 
YKNAPGRYQHFMDNYFNQLENCIIYIDDILLYSR 
TENEHIKLLEKFIHIVEISGISLS 
KKKAEVMKNQIEFLGIQIDKNGIKMQTHVYQKII 
NLNETLDTKKKLQSFLGLVNQVRE 
YIPKLAENLKPLQKKLKKDIEYHFD" 
exon 302 1 ..4229 /note="ORF4" 

/number=l 

mRNA 3021. .4229 /note="ORF4" 

CDS 302 L. 4229 /note-"ORF4" 

/codon-stait=l 

/product="putative translation 
transactivator/inclusion body 
protein" 

/protein-id-"CAC88782.1" 
/db-xref^"GI:15963355" 
/translation="MLLQTTYQDKGTQTDESQES 
KDIFTILTTLSLQMESMGKRLQQL 
ESQQHDYKNAELSRSEDSKLPKVEGDVGKLQKTH 
NTVALYTAAGTSKQVSKKPHINVN 
LNTVFDKPFTSKKPREAIVIAPQTSTYANSLHHN 
KKVYNHITQTYIENIYKIQTFLNL 
NPRSTTTTDPTQDYVTQKLQGYNRLIAQPKTKAN 
LVKTCYNYGLLSTVYTHDGEEIIG 
IPELYKAFVTFKRITKGNLFFIKFYTTPAEILYD 
EIKPIIQVIKIGLTRDMIIPEEIE 
KQPEIQKMEIPSFYANKRIIGISTIIQELANNYL 
QGNAIWSYYSRDQLMIYANSKELR 
QGDMDEVQKWILSLLKPEMQPTTRALKKEFISNE 
LLTRYCKLVGHKYPDHICSKCNGD 
DNYVPEVQLE" 
repeat-unit ' 4282..4470 /rpt-type=TANDEM 

SEQUENCE (SEQ): 

1 gtcgactcga tctaagtata gacaaaatat tagaagtacc aacaataaag aattggttta 
61 aaagacaaaa agaagaatac tatgtagtaa gccaaagaga acatataata gattgtaaat 
121 acatcaaagg taaagcacaa ataccaattt taaataaaag actactaaat aaagaaatac 
181 aagatataaa agcaaaaaat ccaataaaat atgtacactt aggaggaacg gagatcctaa 
241 taaaagcatg ttttagggaa ggaatagata cccctataga aatatacttg gcagatgata 
301 ggattgtaca acctatagaa aagagtataa taagtgctgt aagaggtaac ttaatatacc 
361 aaaaatttaa atttatagta agtgctaact attcagtagc agtagatgat aaaaatatag 
421 ataaatcatt agtattatac tggaaaatgt ctggaataga attagcacca ggaagtaaaa 
481 tattcacagc aagatgtaaa aatctatatg tcttaacaac gaaacataag ataacagcta 
541 aaaataaaat aaataaaata aaaatagaaa atccattcga aagaatagta tcagttatag 
601 acaacaatga ttacagttat acagaaatag acatggatga agatttagaa atagtaaaag 
661 aaagattaag cacatcaaaa cgaataaata atgaaatgcc agaaacatca tcaagaagta 
721 catcaagaag tacatcaaaa agaataaatt ataccactcc acaaaaatta atagaacaaa 
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781 aaatagaaga aataaaccca catcattatt atataacagg aataatggac caaagaaaat 
841 atttaatact aataaataca gggcaggaag agaattatgt tataagagaa ctaataccag 
901 aacaagagat agtaacaata gaacaacaaa attcagaact accaaaagcg ttaagaaaaa 
961 acgaagaaac aactgaaaaa gaattaatta ttggaggaat accaatatta ataaatttta 
1021 aaatatatca aggagataaa aatattacac tagggataaa atggttagaa aaagtcaaac 
1081 catataaatt agaagatagg caattaacaa taagttatga aaataagaaa ataataataa 
1141 aaagaacttt gatataatgc caaagatata catactttcc aaaataatag tagaaggata 
1201 ttataataga tattatacac ctatggtgga tacaggagca gaaactaata tgtgtagaca 
1261 taattgttta ccagaaagta aatgggaaaa gctaaaaaca cccatagtag taacaggatt 
1321 taataatgaa ggaagtatga taacatataa agcaagaaat ataaaaatac aaatatggga 
1381 taaaatatta accatagaag aaatatatag ttacgaattc acaaataaag atatattatt 
1441 aggaatgcca tttttagata aattataccc acatattata acaaaaacac attggtggtt 
1501 tactaccccg tgtaaacaaa aattaggagc aaaaagagta aataataaag taagaaaaac 
1561 aacaccctgg attaaaggaa gtgaaaagat tacccaaaaa ttagaaaata taatacaaag 
1621 taaccataat atagagataa tcattttctc aataaataag ataaaaccac tacaagataa 
1681 actagaatta ctatataatg ataatccact ccaaggatgg gaaaaacatc aaacaaaaat 
1741 aaagattgaa ctaatagatg aaaataacat aataacacaa aaacctttaa aatacaattt 
1801 taatgattta acagaattta aaatgcatat aaaagaatta ttagataata actacataca 
1861 agaaagtaat agtaaacata ctagcccagc atttatagta aataagcata gtgaacaaaa 
1921 aagaggaaaa agccgtatgg ttatagatta tagaaactta aatgcaaaaa caaaaacata 
1981 taattatccg ataccaaata aaatactaaa aattagacaa atacaaggat ataactattt 
2041 tagtaaattt gactgtaaat caggatttta ccatttaaaa ctagaagatg aatctaaaaa 
2101 gttaacagca ttcacagtac cacaaggatt ttacgaatgg aacgtattac cttttggata 
2161 taaaaatgca ccaggtaggt atcaacattt tatggataat tacttcaacc aattagaaaa 
2221 ttgtataata tatatagatg atatattgct atattctaga acagagaacg aacatataaa 
2281 actactagaa aaattcatac acattgtaga aatatcagga ataagtttaa gtaaaaagaa 
2341 agcagaagta atgaaaaatc aaatagaatt tttaggtata caaatagata aaaacggaat 
2401 aaaaatgcaa acccatgtag tacaaaaaat aattaacttg aatgaaacac ttgatacaaa 
2461 aaagaagtta caatcatttt taggattggt taaccaagta agagaatata ttcctaaatt 
2521 agcagaaaac ttaaaaccat tacagaaaaa attaaaaaag gacatagaat atcattttga 
2581 ctaaaaagat aaaatacata tacagaagat aaaaaatatg tgtaaaaaat taccaaaact 
2641 atattttcca gatgaaaaga aacaatttac atatattgta gaaactgatt ctagtgatca 
2701 cagctacgga ggagttctaa aatataaata tgataatgaa aaaattgaac accattgtag 
2761 atattattcc-ggatcataca cggaaccaca attaaaatgg gaa_ataaata ggaaagaac 
2821 atttggatta tataaatgtt tgttagcatt tgagccatat attgtttata acaaatttat 
2881 agtaagaaca gataatacac aagtaaaatg gtggataact cggaaagtac aggattcagt 
2941 aactacaaag gaaataagga gacttgtatt aaatatacaa aattttacat ttacaattga 
3001 agtaatacga actgacaaga atgttattgc agactaccta tcaagacaaa ggtacccaaa 
3061 cagatgaaag ccaagaatca aaagacatat ttacaatcct tactactcta tccttacaga 
3121 tggagagtat gggaaaaaga ttacaacagc tagaaagtca gcagcatgac tataaaaatg 
3181 cggagctaag tcgatcggaa gactctaaac ttccaaaggt agaaggagac gttgggaaac 
3241 tccaaaaaac ccataacaca gttgctttat atacagctgc aggtacaagc aaacaagtta 
3301 gcaaaaagcc acatatcaat gtaaacctaa ataccgtatt tgataagcca tttacatcaa 
3361 aaaagccaag agaagcaata gttatagccc cacaaacttc aacctatgct aatagcctac 
3421 accacaacaa aaaggtatat aaccatatta ctcaaacata tattgagaat atatataaaa 
3481 tccagacatt tctgaaccta aaccctagat caacaactac tacagatcca acacaagatt 
3541 atgtaaccca aaaactacag ggatataata ggcttatagc ccagccaaaa acaaaagcaa 
3601 acttagtaaa aacatgttac aactacggac tactcagcac agtatatacc catgacggag 
3661 aagaaataat tggaatacca gagctataca aagcatttgt caccttcaag agaattacaa 
3721 aaggcaacct attcttcatc aaattctata caacaccagc ggaaatacta tatgatgaaa 
3781 taaaacccat aatacaggtg ataaaaatag gactaacacg agatatgata ataccggaag 
3841 agatagagaa acaaccggag atacagaaaa tggagatacc aagtttctat gccaataaaa 
3901 gaataattgg tatatcaact attattcaag aactagctaa caattatcta caaggaaatg 
3961 ctatctggag ctactattcc agagaccaat tgatgatata tgccaactcg aaggaactac 
4021 gacaaggaga tatggatgaa gtccagaaat ggattttatc attattaaag ccagaaatgc 
4081 aaccaactac cagagcattg aaaaaagaat ttatttcaaa cgagttatta acaagatact 
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4141 gcaaactagt aggacacaaa tatccagacc acatatgttc aaagtgcaac ggagatgata 
4201 actatgtacc agaagtccaa ctagaatgaa ggtatcaaca agaagacaaa agaagaagac 
4261 agctactgga aacatataat gtaaagtaaa tagtactagt cgcattcatg aacagtaaag 
4321 gtcgttcatg aatagtaaga gtcataaagt aaatagtata tgtcataatc atgaacagta 
4381 aaggtcgttc atgaatagta agagtcataa agtaaatagt atatgtcata atcatgaaca 
4441 gtaaaggtcg ttcatgaata gtaggagtca tttgtaaaca gtaagagtcg ttttaatttt 
4501 ctttatatag aatgtaaatc cgaggaagga ggacatcctc agacatcctc atcctcactc 
4561 tctctcttat cttctctcaa taaaatatct aattatatac aaacttcaga aagctatgga 
4621 gcattcaaac gagttacaaa accaggtaag tttattcata atcatgttta actaaactca 
4681 tatattcctt ataatctctt gaatatcata attgtttatc atgttatagt actgttataa 
4741 agaacatctt gagaaagttt gcctgtctaa taatgctgag agtagttaag cccagactat 
4801 gccatcctaa agtggaaacc agtaggcaag gaagccgttt aggggagtac acagagttga 
4861 ggcgctgtta gggtaaatga ttgaagcatg aaaaacatgg tagtgaccag aaaagattgt 
4921 tcagtataac ttattaaaat atgataaact ctatgataaa caaagaggtt agagggaata 
4981 tgtttagtaa acacatgata aggataaata gaaatctgaa tgaacaacca cacgtattta 
5041 ttagaagaaa atattgacta caaaatactt atgttatata tctttaagag acttaataac 
5101 gatgttaaaa gaaaaattta cgatatttta gttacttttg aaataatata tgtaatggaa 
5161 caagcattta cggaactaat tgtatcacat gaaatagata tattagatct atatgaacta 
5221 gatttatatt cagactaatg atcacatatc aagttaataa aatcttttta taaatgaaac 
5281 atagcacttt tcattatata agattctata atacccagaa atggaaactt cttaaaaccc 
5341 tcagtaacat aaatagaaaa atagattgtg ttaaatacaa tattaaaacc tgcaaagata 
5401 ttattagata taagctatga aaaataaaca tttatattaa gaagcaaatc atgaaaaagc 
5461 aaaaaatata gaagaatatc aaaaaactaa agatattaaa tacttagaat acacacttga 
5521 aaatataaaa gaaatttcaa taaaaaagat cctatgaaaa ataagatgaa ctcaaacaac 
5581 cacccccaaa ttggacatac aaggatgact ttcttagtgt gtatacttat ccatttaccn 
5641 ttggcccagt ggagccatcn tccactccaa tagaccccgt gatcctcaac ttctacaaac 
5701 gatattgtat atggtaaaat ctaaaggtcc aattttttgt cactatggca cttcataagc 
5761 acacttctat aggctctaga ttatggtcga gattaacacc tgaggggtta tcgacgccca 
5821 gcctcaagga agtgcaggtt gattaagtta tgattaaaac gtgggaggct ccaaggcatg 
5881 caactagaga tgacaaagtc tactaggcta gttcagaccc gaatcagggc attaaatggt 
5941 tttaccatcc ntatatcttt tcaataatgc atttgtacta tgttgggatt ccntctggta 
6001 tataaaaggg gatcnrtgtc atttttgtaa aggatctgat gctcaatatt gaatacatag 

L8 ANSWER 136 OF-146 GENBANK.RTM. COPYRIGHT 2001 

LOCUS (LOC): NTA413172 GenBank (R) 

GenBank ACC. NO. (GBN): AJ413172 

CAS REGISTRY NO. (RN): 359759-08-3 

SEQUENCE LENGTH (SQL): 6581 

MOLECULE TYPE (CI): DNA; linear 

DIVISION CODE (CI): Plants, fungi, algae 

DATE (DATE): 3 Nov 2001 

DEFINITION (DEF): Nicotiana tabacum pararetro virus- like 

sequence, V6, 
KEYWORDS (ST): 0RF2; 0RF3; 0RF4 
SOURCE: common tobacco. 

ORGANISM (ORGN): Nicotiana tabacum 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; 

Tracheophyta; Spermatophyta; Magnoliophyta; 

eudicotyledons; core eudicots; Asteridae; euasterids I; 

Solanales; Solanaceae; Nicotiana 
NUCLEIC ACID COUNT (NA): 2893 a 897 c 1032 g 1759 t 
REFERENCE: 1 (sites) 

AUTHOR (AU): van der Winden,J. 
JOURNAL (SO) : Unpublished 
REFERENCE: 2 (bases 1 to 6581) 

AUTHOR (AU): van der Winden,J. 
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TITLE (TI): Direct Submission 

JOURNAL (SO): Submitted (25-SEP-2001) van der Winden L, Plant 
Molecular Genetics, Institute of Molecular Biology, 
Billrothstrasse 11, 5020 Salzburg, AUSTRIA 

FEATURES (FEAT): 
Feature Key Location Qualifier 



source 1 . .658 1 /organism="Nicotiana tabacum" 

/db-xref="taxon:4097" 
mRNA <1 . . 1 1 57 /note="ORF2" 
CDS <l..l 157 /note="ORF2; putative movement 

protein" 

/codon-start=3 

' /product="hypothetical protein" 
/protein-id="CAC88194.1" 
/db-xref="GI: 15799222" 
/translation="RLDLSIDKILEVPTIKNWFK 
RQKEEYYVVSQREHIIDCKYIKGK 
AQIPILNKRLLNKEIQDIKAKNPIKYVHLGGTEI 
LIKACFREGIDTPIEIYLADDRIV 
QPIEKSIVSAVRGNLIYQKFKFIVSANYSVAVDD 
KNIDKSLVLYWKMSGIELAPGSKI 
FTARCKNLYVLTTKHKITAKNKINKIKIENPFER 
IVSVIDNNDYSYTEIDMDEDLEIV 
KERLSTSKRINNEMPETSSRSTSRSTSKRINYTT 
PQKLIEQKIEEINPHHYYITGIMD 
QRKYLILINTGQEENYVIRELIPEQEIVTIEQQN 
SELPKALRKNEETTEKELIIGGIP 
ILINFKIYQGDKNITLGIKWLKKVKPYKLEDRQL 

TISYENKKIIIKRTLr' 
mRNA 1157.,1726 /note-"ORF3" 

CDS 1157..1726 /note-"ORF3" 

_ /codonTrStart=l 

/product=^"putative polyprotein 

(truncated)" 

/protein-id="CAC88195.1" 

/db-xref="GI: 15799223" 

/translation="MPKIYILSKIIVEGYYNRYY 

TPMVDTGAEANMCRHNCLPESKWE 

KLKIPIVVIGFNNEGSMITYKARNIKIQIWDKIL 

TIEEIYSYEFTNKDILLGMPFLDK 

LYPHIITKTHWWFTTPCKQKLGAKRVNNKVRKTT 

PWIKGSEKITQKLENVIQSNHNIE 

IIIFSINKIKPLQDKLELLYNDNLLQGWE" 

mRNA 302 1 . .3323 /note-"ORF4" 

CDS 3021. .3323 /note="ORF4; putative 

translocation transactivator 
(truncated)" 
/codon-start=l 

/product="hypothetical protein'* 

/protein-id="CAC88196.1" 

/db-xref^"GI: 15799224" 

/translation="MLLQTTYQDKGTQTDESQES 

KDIFTILTTLSLQMESMGKRLQQL 

ESQQHDYKNAELSRSEDSKLPEVEGDVGKLQKPI 



13 



TQLLYIQLQVQANKLAKSHISM" 
repeat-unit 4281. .4469 /rpt-type=TANDEM 

SEQUENCE (SEQ): 

1 gtcgactcga tctaagtata gacaaaatat tagaagtacc aacaataaag aattggttta 
61 aaagacaaaa agaagaatac tatgtagtaa gccaaagaga acatataata gattgtaaat 
121 acatcaaagg taaagcacaa ataccaattt taaataaaag actactaaat aaagaaatac 
181 aagatataaa agcaaaaaat ccaataaaat atgtacactt aggaggaacg gagatcctaa 
241 taaaagcatg ttttagggaa ggaatagata cccctataga aatatacttg gcagatgata 
301 ggattgtaca acctatagaa aagagtatag taagtgctgt aagaggtaac ttaatatacc 
361 aaaaatttaa atttatagta agtgctaact attcagtagc agtagatgat aaaaatatag 
421 ataaatcatt agtattatac tggaaaatgt ctggaataga attagcacca ggaagtaaaa 
481 tattcacagc aagatgtaaa aatctatatg tcttaacaac aaaacataag ataacagcta 
541 aaaataaaat aaataaaata aaaatagaaa atccattcga aagaatagta tcagttatag 
601 acaacaatga ttacagttat acagaaatag acatggatga agatttagaa atagtaaaag 
661 aaagattaag cacatcaaaa cgaataaata atgaaatgcc agaaacatca tcaagaagta 
721 catcaagaag tacatcaaaa agaataaatt ataccactcc acaaaaatta atagaacaaa 
781 aaatagaaga aataaaccca catcattatt atataacagg aataatggac caaagaaaat 
841 atttaatact aataaataca gggcaggaag agaattatgt tataagagaa ctaataccag 
901 aacaagagat agtaacaata gaacaacaaa attcagaact accaaaagcg ttaagaaaaa 
961 acgaagaaac aactgaaaaa gaattaatta ttggaggaat accaatatta ataaatttta 
1021 aaatatatca aggagataaa aatattacac tagggataaa atggttaaaa aaagtcaaac 
1081 catataaatt agaagatagg caattaacaa taagttatga aaataagaaa ataataataa 
1141 aaagaacttt gatataatgc caaagatata catactttcc aaaataatag tagaaggata 
1201 ttataataga tattatacac ctatggtgga tacaggagca gaagctaata tgtgtagaca 
1261 taattgttta ccagaaagta aatgggaaaa gctaaaaatc cccatagtag taataggatt 
1321 taataatgaa ggaagtatga taacatataa agcaagaaat ataaaaatac aaatatggga 
1381 taaaatatta accatagaag aaatatatag ttacgaattc acaaataaag atatattatt 
1441 aggaatgcca tttttagata aattataccc acatattata acaaaaacac attggtggtt 
1501 tactaccccg tgtaaacaaa aattaggagc aaaaagagta aataataaag taagaaaaac 
1561 aacaccctgg attaaaggaa gtgaaaagat tacccaaaaa ttagaaaatg taatacaaag 
1621 taaccataat atagagataa tcattttctc aataaataag ataaaaccac tacaagataa 
1681 actagaatta ctatataatg ataatctact ccaaggatgg gaataacatc aaacaaaaat 
1741 aaagattgaa ctaatagatg aaaatagcat aataacacaa aaacctttaa aatacaattt 
1801 taatgattta acagaattta aaatgcatat aaaagaatta ttagataata actacataca 
1861 agaaagtaat agtaaacata ctaagccagc atttatagta aataagcata gtgaacaaaa 
1921 aagaggaaaa agccgtatgg ttatagatta tagaaactta aatgcaaaaa caaaaacata 
1981 taattatccg ataccaaata aaatactgaa aattagacaa atacaaggat ataactattt 
2041 tagtaaattt gactgtaaat cgggatttta ccatttaaaa ctagaagatg aatctaaaaa 
2101 gttaacagca ttcacagtac cacaaggatt ttacgaatgg aacgtattac cttttggata 
2161 taaaaatgca ccaggtaggt atcaacattt tatggataat tacttcaacc aattagaaaa 
2221 ttgtataata tatatagatg atatattgct atattctaga acagagaacg aacatataaa 
2281 actactagaa aaattcatac acattgtaga aatatcagga ataagtttaa gtaaaaagaa 
2341 agcagaagta atgaaaaatc aaatagaatt tttaggtata caaatagata aaaacggaat 
2401 aaaaatgcaa acccatgtag tacaaaaaat aattaacttg aatgaaacac ttgatacaaa 
2461 aaagaagtta caatcatttt taggattggt taaccaagta agagaatata ttcctaaatt 
2521 agcagaaaac ttaaaaccat tacagaaaaa attaaaaaag gacatagaat atcattttga 
2581 cgaaaaagat aaaatacata tacagaagat aaaaaatatg tgtaaaaaat taccaaaact 
2641 atattttcca gatgaaaaga aacaatttac atatattgta gaaactgatt ctagtgatca 
2701 cagctacgga ggagttctaa aatataaata tgataatgaa aaaattgaac accattgtag 
2761 atattattcc ggatcataca cggaaccaca attaaaatgg gaaataaata ggaaagaact 
2821 atttggatta tataaatgtt tgttagcatt tgagccatat attgtttata acaaatttat 
2881 agtaagaaca gataatacac aagtaaaatg gtggataact cggaaagtac aggattcagt 
2941 aactacaaag gaaataagga gacttgtatt aaatatacaa aattttacat ttacaattga 
3001 agtaatacga actgacaaga atgttattgc agactaccta tcaagacaaa ggtacccaaa 
3061 cagatgaaag ccaagaatca aaagacatat ttacaatcct tactactcta tccttacaga 
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3121 tggagagtat gggaaaaaga ttacaacagc tagaaagtca gcagcatgac tataaaaatg 
3181 cggagctaag tcgatcggaa gactctaaac ttccagaggt agaaggagac gttgggaaac 
3241 tccaaaaacc cataacacag ttgctttata tacagctgca ggtacaagca aacaagttag 
3301 caaaaagcca catatcaatg taaacctaaa taccgtattt gataagccat ttacatcaaa 
3361 aaagccaaga gaagcaatag ttatagcccc acaaacttca acctatgcta atagcctaca 
3421 ccacaacaaa aaggtatata accatattac tcaaacatat attgagaata tatataaaat 
3481 ccagacattt ctgaacctaa accctagatc aacaactact acagatccaa cacaagatta 
3541 tgtaacccaa aaactacagg gatataatag gcttatagcc cagccaaaaa caaaagcaaa 
3601 cttagtaaaa acatgttaca actacggact acttagcaca gtatataccc atgacggaga 
3661 agaaataatt ggaataccag agctatacaa agcatttgtc accttcaaga gaattacaaa 
3721 aggcaaacta ttcttcatca aattctatac agcaccagcg gaaatactat atgatgaaat 
3781 aaaacccata atacaggtga taaagatagg actaacacga gatatgataa taccagaaga 
3841 gatagagaaa caaccggaga tacagaaaat ggagatacca agtttctatg ccaataaaag 
3901 aataattggt atagcaacta ttattcaaga actagctaac aattatctac aaggaaatgc 
3961 tatctggagc tactattcca gagaccaatt gatgatatat gccaactcga aggaactacg 
4021 acaaggagat atggatgaag tccagaaatg gattttatca ttattaaagc cagaaatgca 
4081 accaactacc agagcattga aaaaagaatt tatttcaaac gagttattaa caagatactg 
4141 caaactagta ggacacaaat atccagacca catatgttca aagtgcaacg gagatgataa 
4201 ctatgtacca gaagtccaac tagaatgaag gtatcaacaa gaagacaaaa gaagaagaca 
4261 gctactggaa acatataatg taaagtaaat agtactagtc gcattcatga acagtaaagg 
4321 tcgttcatga atagtaagag tcataaagta aatagtatat gtcataatca tgaacagtaa 
4381 aggtcgttca tgaatagtaa gagtcataaa gtaaatagta tatgtcataa tcatgaacag 
4441 taaaggtcgt tcatgaatag taggagtcat ttgtaaacag gaagagtcgt tttaattttc 
4501 tttatataga atgtaaatct gaggaaggag gacatcctca gacatcctca cccttacctt 
4561 ctctctctaa tattctctct tatcttctct cttgtaagaa atatctgagt taattacaag 
4621 tatctaaaac agctatggag cattcaaacg agttacaaaa ccaggtaagt ttattcataa 
4681 tcatgtttaa ctaaactcat atattcctta taatctcttg aatatcataa ttgtttatca 
4741 tgttatagta ctgttataaa gaacatcttg agaaagtttg cctgtctaat aatgctgaga 
4801 gtagttaagc ccagactatg ccatcctaaa gtggaaacca gtaggcaagg aagccgttta 
4861 ggggagtata cagagttgag gcgctgttag ggtaaatgat tgaagcatga aaaacatggt 
4921 agtgaccaga aaagattgtt cagtataact tattaaaata tgataaactc tatgataaac 
4981 aaagaggtta gagggaatat gtttagtaaa cacatgataa ggataaatag aaatctgaat 
5041 gaacaaccac acgtatttat tagaagaaaa tattgactac aaaatactta tgttatatat 
5101 ctttaagaga cttaataacg atgttaaaag aaaaatttac gatattttag ttacttttga 
5161 aataatatat gtaatggaac aagcatttac ggaactaatt gtatcacatg aaatagatat 
5221 attagatcta tatgaactag atttatattc agactaatga tcacatatca agttaataaa 
5281 atctttttat aaatgaaaca taacactttt cattatataa gattctataa tacccagaaa 
5341 tggaaacttc ttaaaaacct cagtaacata aatagaaaaa tagattgtgt taaatacaat 
5401 attaaaacct gcaaagatat tattagatat aataaattac gtaataaagc tttactaact 
5461 atagaaaatg aaattgaatt ttatgaagat aaacttgaaa gttatcaaca cagaaaagaa 
5521 ataacactga gaaatataga aaccatggat atatgtatca acaggtatac atctcagtat 
5581 taatatgaac aaatcaaaaa ttgatagtta caaatatttt aagtactggt tagaaattct 
5641 aaaacatata aatatgtcaa aaataatatt agaaaaaaat atagaagaat atcaaaaaac 
5701 taaagatatt aaatacttag aatacacact tgaaaatata aaagaaattt caaggttaat 
5761 ttcgctagct acagataatc atgaaaaagc atttaattcc acaactacat aaaaaagatc 
5821 ctatgaaaaa taagatgaac tcaaacaacc ccattctggt ggggatctaa tgatatatgg 
5881 attggtgaag gtaagaagga gcggatggct ttgtatctaa gatcacaagg tataggcttg 
5941 gacaattatt ggtttgtata ttggtatgta tcatacatct acaagttttg gtatagaagt 
6001 tgcagtcagg ataaatatag cactcatacg tattctggaa tttctaattg catgtgtaaa 
6061 ctgggatcta atagcagtac aggtgctagt atcttgtttt atttttctag gtattgtcat 
6121 ggcaccatta agtggtattg gtatgtttac atgcccatcc tagagctgtc aaggtagtgc 
6181 atggctatgg tttgaaatgc aggtattggc atattgcttc tggaggcagc tgttgtcatt 
6241 ttctccaact ctagacgagt ttaaattgat gtcaagagtt tcttgggcaa tttgtcagtc 
6301 attcgttgat ccagggaaga agaaatagtc atgccattcc atagattttg cacaattgtg 
6361 ctactttaat attgggacac ttccaaaagt tgatggtatt actaggagat tcacatatgc 
6421 cttgtctttt ataatgcaaa acatggcgca tggtgagaga tttagagctg ctacattgag 
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6481 gtcttgccca tggcttgtgg atacacatac caccatatgt ttggagttgt ctatgatatc 
654 1 atttgtaaca gttagctcta ttacatagtt agttcaagct t 

L8 ANSWER 137 OF 146 GENBANK.RTM. COPYRIGHT 2001 

LOCUS (LOC): AF190123 GenBank (R) 

GenBank ACC. NO. (GBN): AF190123 

CAS REGISTRY NO. (RN): 248236-12-6 

SEQUENCE LENGTH (SQL): 7767 

MOLECULE TYPE (CI): DNA; linear 

DIVISION CODE (CI): Viruses 

DATE (DATE): 23 Jun 2000 

DEFESflTION (DEF): Tobacco vein-clearing virus, complete genome. 
SOURCE: tobacco vein-clearing virus. 

ORGANISM (ORGN): tobacco vein-clearing virus 

Viruses; Retroid viruses 
NUCLEIC ACID COUNT (NA): 3729 a 959 c 1150g 1929 t 
REFERENCE: 1 (bases 1 to 7767) 

AUTHOR (AU): Lockhart,B.E.; Menke,J.; Dahal,G.; 01szewski,N.E. 
TITLE (TI): Characterization and genomic analysis of tobacco vein 
clearing virus, a plant pararetro virus that 
is transmitted vertically and related to sequences 
integrated in the host genome 
JOURNAL (SO): J. Gen. Virol, 81 Pt 6, 1579-1585 (2000) 
OTHER SOURCE (OS): CA 133:306131 
REFERENCE: 2 (bases 1 to 7767) 

AUTHOR (AU): Lockhart,B.E.; Menke,J.; Dahal,G.; 01szewski,N. 
TITLE (TI): Direct Submission 

JOURNAL (SO): Submitted (25-SEP-1999) Plant Biology, University of 
Minnesota, 220 Biological Sciences Center, St. Paul, MN 
55108, USA 

FEATURES (FEAT): 
Feature Key Location Qualifier 
==^======+======-====^ 

source 1..7767 /organism=" tobacco vein-clearing 

virus" 

/specific-host="Nicotiana 
edwardsonii" 
/db-xref="taxon: 107324" 
CDS 416..2155 /note="ORF 1" 

/codon-start=l 

/product="putative coat protein" 
/protein-id="AAF08287.1 " 
/db-xref="GI:6425073" 

/translation="MNKEEFAIDEKTYENQDGLK 

IKIIFSNLGRRYKKIGDQIYLMIE 

KETARLEDSLTAMTRIIKENEEIDKKREIEIIRN 

QANKEIQQLEETKNTKIIALEKEL 

NMLKLLYENKQREKDKEIELTDEINKFKQKLQPE 

ENLRIEEIDNNIDDQKSELSEISE 

TYTEILENIEKTKNLEINTGDVNMDEKPSTSGIK 

NPKELNPSYYTVSYEQSDRNNTLW 

NSRLNKKWTPKPIHEQYNFLDLDCVEDINKTIQL 

WIGYISKQLIDNKIGITETPGYIE 

RTLIGTVKLWLHNLTKESIDTLRSNKKFNGDMAT 

TSMEILYKYEIAIRNEFSSMTTET 
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EEQHKEKQINRNLMTKLAICNMCYIDEYTCAFRE 
YYYKGTYNTEEGREIRKLYFTKLP 
EPFSSKIIKDWNEAGLTDTLGARIKFLQQWFVQL 
CEKYKEEIKMEKILIKNLKCCKNK 
TAPQFGCTDKYNKKWKTKKYKKHRSKYKYKKPRK 
RYYVKNYQAKKPFRPKKKLTECTC 
YNCGKLGHIAKDCKAPKNPKKKQITEIIIDDEEY 
MQMEYIDYELDSEDSIYEISDSDD 
EIDSEITEEEDEI" 
CDS 2155.3414 /note="ORF 2" 

/codon-start=l 

/product="putative cell-to-cell 
movement protein" 
/protein-id="AAF08288.1" 
/db-xref="GI:6425074" 
/translation="MTEKEIKVVAREEYQNEESS 
EQKIIFDNNIFEQIKGKELDLSVE 
KIFEVPTIKNWFKRQKEEYYVVSQKEHIIDCKYT 
GGKAKIPIINKRIINKEIQDIKAK 
NPDCYVHLGGTEILIKACFREGIDTPIEIYLADD 
RIIQPIEKSIISAIKGNLIYQKFK 
FIISANYSVALNDRNIDKSLVLYWKMSGIELAPG 
SKVFTARCKNLYVLTTKHKIAAKN 
KINKIKIENPFERIVTVIDNNEYSYKEIDMEEDL 
EIVKERLSTSSIPKQLTHETPTSS 
RISTSKREYIIPKNLARNTIEEIKPYHYYITGIM 
EQRKYQILINTGQEENYIIRELVA 
ESEIITTEQLCPELPKELIINEEITEKEIIIGGI 
PLIIQFKIYQGNENITLGIKWLER 
LKPYNLEDKQLTITYKNKKVIIKRTEE" 
CDS 341 1.. 53 18 /note-"ORF 3; contains aspartic 

protease, reverse transcriptase 
and RNAse H" 
/codon-start=l 
/product="polyprotein" 
/protein-id="AAF08289. 1 " 
/db-xref="GI:6425075" 
/translation="MKIYILAKIIVEGYYNRYYT 
PMIDTGAEANICKYNCLPTDKWEK 
LKTPMVVTGFNNEGSMINYKARNVKIQIWDKILT 
lEEFWFEFTTKDMLLGMPFLEKL 
YPHIITKTHWWFTTPCKNKVGAKRVNNKQRKTTE 
WIRGSEKITQKLENINKNTTTQLE 
IIIFTIDKVKIIQKKLEKLYNDNPLQGWEKHKTK 
VKIELIEENSIITQKPLKYNFNDL 
TEFKIHIKDLLDNKYIQESNSKHTSPAFIVNKHS 
EQKRGKSRMVIDYRNLNAKTKTYN 
YPIPNKILKIRQIQGYNYFSKFDCKSGFYHLKLE 
DESKKLTAFTVPQGFYEWNVLPFG 
YKNAPGRYQHFMDNYFNQLENCIVYIDDILLYSR 
TQDEHIKLLEKFAHIIENSGISLS 
KTKAEIMKNQIEFLGIQIDKNGIKMQTHIVQKII 
NLDENIDTKKKLQSFLGIVNQVRE 
YIPKLAENLKPLQKKLKKDVEYSFDEKDKEQIKK 
IKILCKKLPKLYFPDENKKFTYIV 
ETDSSNYSYGGVLKYRYNKEKIEHHCRYYSGSYT 
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EPQEKWEINRKELFALYKCLLAFE 
PYIVYTRFIVRTDNTQVKWWITRKVQDSVTTKEI 
RRLVLNILNFTFTIEIINTNKNVV 
ADYLSRQSYPN" 
CDS 5275..6528 /note="ORF4" 

/codon-start=l 

/product="putative transactivator 
factor" 

/protein-id-"AAF08290,l" 

/db-xref^"GI:6425076" 

/translation="MLLQIIYQDKATQTDPDNTM 

ENLLLAMTTLCKKVESMDEEIQIM 

RKTASSQQHDSKNAELRQSEVSKIPELEGDVEKH 

LKTHNGLLNTVAGSSTTSSSSARK 

KEDIKPRYTNINMNNLFSKPFVQRNTQDTQKEIF 

LPPQVNTYKESLNQTKKTYNHITR 

TYIDNIHKIQNFLNKNPRSQSTQNPNEDYITHHL 

TGYNKLIALPNTSAKLVATCYNYG 

LLDTVYTQTGQEIANIPELYKAFMQYKRITKGTL 

FYVRFYSATAEILYEEIKPIIQVI 

KIGLTREMIIPEKIEEQEEIEKVNIPEFYANKRI 

IGISTILNELANNYLNQNAIWSYY 

SREQTMIYSNCRDIREADMEELRQWVLSLLKPEQ 

PTTTRAIRRNFISPDLLTRYCKLI 

SHKYPDHLCSKCKGEDNIVPDVQLE" 



SEQUENCE (SEQ): 

1 tggtatcaga gccagtacac aaaaagaaac acatcaagtt aataaagaag accaaataag 
61 aaacagtgta agaaaaacaa acaagaaact aagaaagtta tactctgaat ataaaaaatt 
121 acacataaca aagatagata ataataggtt agaacaacta atagatatca tatctaatac 
181 agaatataaa tatatccagt atttgaaaaa acagtatgaa caaggaagaa tataaacaac 
241 aactttttga aaaaattaca gaaaaaagac aagaattacg acatagacaa gataggctta 
301 agaataatat atttgctgga acttgccaga aatctttact agttcaaaga aaaattatat 
361 taaatttaca aatacaaata caacagtttg aatatatact ccaacataac ctataatgaa 
421 taaagaagaa tttgcgatag atgaaaagac atatgaaaac caagatggat tgaaaataaa 
481 gataatattt tcaaatctag gtagaagata taaaaaaata ggagaccaaa tatacctaat 
541 gatagaaaaa gaaacagcaa gattagaaga tagcctgaca gccatgacaa gaataattaa 
601 ggaaaatgaa gaaattgata aaaagaggga aattgaaatt atcagaaatc aagcaaataa 
661 ggaaatacaa caattggaag aaaccaagaa tacaaaaatt atagcattag aaaaagaatt 
721 aaatatgtta aaattactat atgaaaataa acaaagagag aaagataaag aaatagaatt 
781 aacggatgag ataaataaat tcaaacaaaa attacaacca gaagaaaacc tcagaattga 
841 agaaatagat aataatatag atgaccaaaa gtcagagcta agcgaaataa gtgaaacata 
901 tacagaaatt ttagaaaata tagaaaaaac aaaaaattta gaaataaata caggggatgt 
961 aaatatggat gaaaaaccta gcacatcagg tataaagaat ccaaaagaat taaacccaag 
1021 ttattataca gttagttatg aacaatctga tagaaataat acactatgga atagtagact 
1081 aaataaaaaa tggacaccaa aaccaataca tgagcaatat aactttttag atttagattg 
1 141 tgtagaggat ataaataaaa caatccaact atggatagga tatatatcta aacaattaat 
1201 agataataaa ataggaataa cagaaacacc aggatatata gaaaggacac ttatagggac 
1261 agtaaaatta tggttacata atttaacaaa agaaagtata gatactttaa gaagtaataa 
1321 aaaatttaat ggtgacatgg caacaacgag tatggaaata ttatacaaat atgaaatagc 
1381 cataagaaac gaatttagta gtatgactac agaaacagaa gaacaacata aggaaaaaca 
1441 aataaataga aacctaatga caaaattagc aatatgtaat atgtgctata tagatgaata 
1501 tacttgtgca tttagagaat attattataa aggaacatat aatacagaag aaggcagaga 
1561 aataagaaaa ttatatttca caaaattacc agaacctttt agttcaaaaa taataaagga 
1621 ttggaatgaa gcaggattaa cagatacttt aggagctagg ataaaatttt tacaacaatg 
1681 gtttgtacaa ttatgtgaga aatataaaga agaaataaaa atggaaaaaa tattaataaa 
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1741 aaacttaaaa tgttgtaaaa ataaaacagc accacaattt ggatgtacag ataaatataa 
1801 caaaaaatgg aaaacaaaga aatataaaaa acataggtca aagtataaat ataaaaaacc 
1861 aagaaaaaga tattatgtaa aaaattatca ggctaaaaaa ccatttagac caaaaaagaa 
1921 actaacagaa tgtacttgtt ataactgtgg aaaattagga catatagcta aagactgcaa 
1981 agcacctaaa aacccaaaga aaaaacaaat aactgaaatt ataattgatg atgaagaata 
2041 tatgcagatg gaatatatag attatgaact agatagtgaa gacagtatat atgaaatatc 
2101 agattcagat gatgagatag atagtgaaat aactgaagaa gaagatgaaa tctaatgact 
2161 gaaaaagaaa taaaagtagt agcacgagaa gaatatcaaa atgaagaatc ctcagaacaa 
2221 aaaattatat ttgataataa tatatttgaa caaataaaag gaaaagaatt agaccttagt 
2281 gtcgaaaaaa tatttgaagt acctacgata aaaaattggt tcaaaagaca aaaagaagaa 
2341 tactacgtag ttagccaaaa agaacatata attgattgca aatatacagg aggtaaggcc 
2401 aaaataccaa taataaacaa aagaataata aataaagaaa tacaagatat aaaggcaaaa 
2461 aacccaatta agtatgtaca cttaggagga actgaaatat taataaaagc atgttttaga 
2521 gaaggaatag atacccctat agaaatatat ctagcagatg atagaataat ccaacctata 
2581 gaaaaaagca taataagtgc aataaaagga aatcttatat atcaaaagtt taaatttata 
2641 ataagcgcca actattcagt agcactaaat gatagaaata tagataaatc attagtatta 
2701 tactggaaga tgtcaggaat agagttagca cccggaagta aagtatttac agccagatgt 
2761 aagaacctat acgtattaac aacaaaacat aaaatagcag cgaaaaataa aattaataaa 
2821 atcaaaatag aaaacccgtt tgaaagaata gttacagtta ttgacaataa tgaatatagc 
2881 tataaagaaa ttgacatgga agaagattta gaaatagtaa aagaaagatt aagtacatca 
2941 agtataccaa aacaattaac tcatgaaaca ccaacatcat caagaataag tacatcgaaa 
3001 agagaatata taatcccaaa aaatttagcc agaaacacaa tagaagaaat aaagccatat 
3061 cattattata taactggaat tatggaacaa cgaaaatatc aaatattaat aaacacagga 
3121 caggaagaaa attatatcat aagagaatta gtagcagaaa gtgaaataat aactactgaa 
3181 caactatgcc ctgaattacc taaagaatta ataataaatg aagaaataac agaaaaagaa 
3241 ataattattg gaggaatacc tctaataata caatttaaaa tatatcaagg aaacgaaaat 
3301 attacactag gaataaaatg gttagaaaga ttaaaaccat ataatttaga agataagcaa 
3361 ttaactataa catataagaa taaaaaagta ataataaaaa gaacagaaga atgaaaatat 
3421 acatacttgc aaaaatcata gtagaaggat attataatag atactataca ccaatgatag 
3481 atacaggagc agaagctaat atatgtaaat ataattgttt accaacagat aaatgggaaa 
3541 aattaaaaac acctatggta gtaacaggat ttaataatga aggtagtafg attaactaca 
3601 aagctagaaa cgtaaaaata caaatatggg ataaaatatt aactatagaa gaaatatata 
3661 actttgagtt taccacaaaa gatatgttat taggaatgcc attcttagaa aagttatacc 
3721 cacatataat aacaaaaaca cactggtggt ttaccacacc atgtaaaaat aaagtaggag 
3781 ctaaaagagt aaataataaa caacgaaaaa ctacagaatg gatacgaggc agtgaaaaga 
3841 tcacacaaaa attagaaaat ataaataaaa acacaactac ccaattagaa attatcatat 
3901 ttacaataga taaagtaaag ataatccaga aaaaattaga aaaattatat aatgataacc 
3961 ctctacaagg atgggagaaa cataaaacaa aagtaaaaat agaactaatt gaagagaata 
4021 gtataataac acaaaaacct ctaaaatata actttaatga tttaacagaa tttaaaatac 
4081 atataaaaga tttattggat aataaatata tacaagaaag taatagcaaa catacaagtc 
4141 cagcatttat agtaaataaa catagtgaac aaaaaagagg aaaaagcagg atggtcatag 
4201 actatagaaa tttaaatgca aaaactaaaa catataatta tccaataccc aataaaatat 
4261 taaagataag acaaatacaa ggatataatt actttagtaa atttgactgt aagtcaggat 
4321 tttatcattt gaaactagag gatgaatcta aaaaattaac agcttttaca gtaccacaag 
4381 gattttatga atggaatgtt ttaccatttg gatataaaaa tgcaccagga agataccaac 
4441 attttatgga caactatttt aatcaactag aaaattgtat agtatatata gatgatatac 
4501 tactatattc aagaacacaa gatgaacata taaaattatt agaaaaattc gcacatatta 
4561 tagaaaactc tggtataagc ttaagcaaaa ccaaagcaga aattatgaaa aatcagatag 
4621 aatttttagg aatacaaata gataaaaatg gaataaaaat gcaaactcat atagtacaaa 
4681 aaataatcaa tcttgatgaa aacattgata caaaaaagaa attacaatca tttctaggaa 
4741 tagtaaacca agtaagggaa tatataccta aattagcaga aaacttaaaa cctttacaga 
4801 aaaaactaaa aaaggatgta gaatatagtt ttgatgaaaa agataaggaa caaataaaga 
4861 aaattaagat attgtgtaaa aaattaccaa aattatactt tccagatgaa aacaaaaaat 
4921 ttacctatat tgtagaaaca gattcaagta attatagcta tggaggagtt cttaagtata 
4981 gatataataa agaaaaaata gaacatcatt gcagatacta ttcaggatct tatacagaac 
5041 cacaagaaaa atgggaaata aatagaaaag aattatttgc attatataaa tgtctattag 
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5101 catttgaacc atatatagtt tacaccagat ttattgtaag aacagacaat acacaagtaa 
5161 aatggtggat aactagaaaa gtacaagatt cagtcacaac aaaagaaata cggagactgg 
5221 tattaaatat attaaacttt acatttacaa ttgaaataat taatactaac aagaatgttg 
5281 ttgcagatta tttatcaaga caaagctacc caaactgatc cagataatac aatggaaaat 
5341 ctactcttgg ccatgactac actttgtaaa aaagtggaaa gtatggatga agagatacaa 
5401 ataatgagaa aaacagctag cagtcagcag catgactcaa aaaatgcgga gctaagacaa 
5461 tcggaagtct ctaaaattcc agagctagaa ggtgacgttg agaaacacct aaaaactcat 
5521 aacggtttgt taaatacagt tgcaggaagc agcacaacta gcagttcatc tgcaaggaaa 
5581 aaggaagata ttaaacctag atacactaat ataaatatga acaatttatt ttcaaaacca 
5641 ttcgtacaga gaaataccca agatacccag aaagaaatat tcttaccacc acaggtaaat 
5701 acctacaaag aaagtttaaa ccaaaccaaa aagacatata accatataac ccgtacctat 
5761 atagacaata tacataaaat acagaatttt ttaaataaaa atccaagatc ccaatctacc 
5821 caaaatccaa acgaagacta tattacccac catctaacag gatacaataa attgatagct 
5881 ttaccaaata caagtgcaaa attagtagcc acctgctaca actacggact actagatact 
5941 gtatatacac agacaggaca agagatagcc aatatacctg agttatacaa ggcattcatg 
6001 caatataaga ggataactaa agggacatta ttctacgtac gattctattc ggctacagca 
6061 gagatactat atgaagagat aaagcctatc atacaagtta tcaaaattgg actcacaagg 
6121 gaaatgatca taccagaaaa gatagaagaa caagaagaga tagaaaaagt aaatattcca 
6181 gaattctatg ctaacaaaag gataattgga atatctacta tactaaatga actagcaaac 
6241 aactatctaa atcagaatgc aatttggagc tattattcaa gggaacagac aatgatatat 
6301 tcaaattgca gagatataag agaagcagat atggaagaac taagacagtg ggtcttaagt 
6361 cttctaaaac cagaacaacc tacaaccaca agagctataa ggaggaattt catctcccca 
6421 gatctgttaa caagatactg caagctaatt agtcacaaat atccagacca cctatgctca 
6481 aaatgcaaag gagaagataa cartgttcca gacgtacaac tggaataatt cagatagaag 
6541 aaaacgacaa aacaacaaga tatgacagaa agatctactt ttatagattc ctttttcttt 
6601 tgtaaagaaa tgttattttg ttttttataa aagtcgtaaa gaatagttta ctttattaaa 
6661 gtaagagtca ttcatgaaca gtaaaggtcg ttcatgaata gtaagagtcg tttgtaaata 
6721 gtaagagtcg tttgtaaata gtaagagtcg tttgtaaata gtaagagtcg ttagtaagag 
6781 tcgtttgtga atagtaagag tcaatttttg tattataaat agcagttcaa atgtgaataa 
6841 aaaacaggct gcagttttca agcatccaac aattcctctc tcttctctct aatatatttt 
6901 gcagatataa gcatacgaaa gctatggaac aacaaggaga tagcagaaat ctacaaaaac 
6961 aggtatatca ttttccaact atgaacagct aaattactat attcctgata atttctatgt 
7021 aaattataat tgttcatcat actatagtac tgttataaaa atcatcttga gaaagtttgc 
7081 catgataata tgctaatagt aggtaaaccc agactatgcc atcctaatgt tgaaaccggt 
7141 aggcagagaa gccgtttagg ggagtaaacg aagttgaagg cgctgttagg gtaattgatc 
7201 gaagcagaaa atcatggtag tgaccagaaa agatgactaa aataacttat tataatatga 
7261 tgaataagta taatagacat agagattaga aggaatatgt ttagcaaatc atatgaaaaa 
7321 tgaatactta cttattagat gatgatacta gttataagtt actcatacta tatatcctaa 
7381 aaaatatcga aaatgataca agaagaataa tctatgaaaa actagttact tttgaaatgt 
7441 tagaaataat aaatcaaaat tggacagaat taattatacc agatgatata aatatagact 
7501 tatatgagtt agacttatat tcagaataat aatgaatcat atataccaac aatattggca 
7561 acaaactact gaaagcataa ctttactagg aaatttagta aaaacaaata tagaagagta 
7621 tcaaagaacc caagatatac aatatataga atacatattg gaaagcttga aagaaatact 
7681 tagattaatt ccgctacaaa aagaatatta tgaaaaagca tttaatttaa ataattaaat 
7741 gacaaaagat aagaccaccc cccaatt 

L8 ANSWER 138 OF 146 GENBANK.RTM. COPYRIGHT 2001 

LOCUS (LOC): NTA238747 GenBank (R) 

GenBank ACC. NO. (GBN): AJ238747 

CAS REGISTRY NO. (RN): 225689-87-2 

SEQUENCE LENGTH (SQL): 7981 

MOLECULE TYPE (CI): DNA; linear 

DIVISION CODE (CI): Plants, fungi, algae 

DATE (DATE): 11 Nov 1999 

DEFINITION (DEF): Nicotiana tabacum pararetrovirus-like 
sequence, ORFl, 0RF2, 0RF3 and 0RF4. 
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KEYWORDS (ST): coat protein; inclusion body protein; movement protein; 

polyprotein; translation transactivator 
SOURCE: common tobacco. 

ORGANISM (ORGN): Nicotiana tabacum 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; 

Tracheophyta; Spermatophyta; Magnohophyta; 

eudicotyledons; core eudicots; Asteridae; euastends I; 

Solanales; Solanaceae; Nicotiana 
NUCLEIC ACID COUNT (NA): 3814 a 1023 c 1189 g 1955 t 
RFFERENCE* 1 (bases 1 to 7981) 

AUTHOR (AU): JakowitschJ.; Mette,M.F.; van Der Winden,J.; 

Matzke,M.A.; Matzke,AJ. 
TITLE (TI): Integrated pararetroviral sequences define a unique 

class of dispersed repetitive DN A in plants 
JOURNAL (SoT Proc. Natl Acad. Sci. U.S.A., 96 (23), 13241-13246 

(1999) 

OTHER SOURCE (OS): CA 132:103712 
REFERENCE: 2 (bases 1 to 7981) 

AUTHOR (AU): Jakowitsch,J. 
TTTT F (TIV Direct Submission 

JOURNAL (SO): Submitted (16-APR-1999) Jakowitsch J., Institute of 
Molecular Biology, Austrian Academy of Sciences, 
Billrothstrasse 1 1, Salzburg, Salzburg, A - 5020, 
AUSTRLV 



FEATURES (FEAT): 
Feature Key Location 



Qualifier 



source 



primer-bind 1..12 



gene 
CDS 



1..7981 /organism="Nicotiana tabacum" 

/cultivar="petit havana SRI" 
/db-xref="taxon:4097" 

/partial 
/gene="tRNA-Met" 
/note="putative minus-strand DNA 

primer" 

1 12 /gene="tRNA-Met" 
267..2063 /note="ORFl" 
/codon-start=l 

/product="putative coat protein" 
/protein-id="CAB42620.1" 
/db-xref="GI:4775494" 
/translation="MNKEEFALEEKTYENPEGLK 

ITIIFSNLGRRYKKIGNNLNLMLE 
KETVKLEDSLTAMVRITKENEEIDRKREIKEIQQ 

QAKEKIQQIEEVKNTKITELEKEL 
EMLKQMYANKLKEKEKRKEEEEELKLTNEIERFK 

LQLEEVQESPSKISINEINDQSDK 
DTELGENYSETSETYTELIEKTEKIKINPEINTG 

DMNEDKPSISGIKNPKQLNPTYYR _ 
VSYDAYDRNKTLWDKRLNKKWAPRQITEQYNFLD 

LDCVADINKTIQLWIGYISKQLID 
NKITITETPGYIERTLIGTVKLWLQNLSSESLDT 

LRSNKKLDGTTTTTVTDILNKYEI 
AIRNEFSSMTTEVEEQNKEKITNRNLMTKLAICN 

MCYIDEYTCAFRDYYYKGTYSPDE 
SKEIRKLYFTKLPEPFSSKIIKSWNEAGLADTLG 
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VRIKFLQNWFIELCEKYKENMKME 
KILVKNLACCKSRIAPQFGCTDKYYKKEGKKKKF 

KSKYSKYKYRKPRRRYYVKNYKHK 
KPYRKKKKLTECTCYNCGKLGHLAKDCKLPKNPK 

KKQITEILIDNDKYTQVEYVDYEL 
SSEDSIYEISENEFSENEINKDIEESDEENYD" 

CDS 2044..3321 /note="ORF2" 

/codon-start=l 

/product="putative movement 
protein" 

/protein-id="CAB42621.1" 

/db-xref="GI:4775495" 
/translation="MKKIMTEKDIQIlQQEEHQD 

EOSSEQKIIFDANIFEQIKGKELD 
LSIDKILEVPTIKNWFKRQKEEYYWSQREHIID 

CKYIKGKAQIPILNKRLLNKEIQD 
IKAKNPIKYVHLGGTEILIKACFREGIDTPIEIY 

LADDRIVQPIEKSIISAVRGNLIY 
QKFKFIVSANYSVAVDDKNIDKSLVLYWKMSGIE 

LAPGSKIFTARCKNLYVLTTKHKI 
TAKNKINKIKIENPFERIVSVIDNNDYSYTEIDM 

DEDLEIVKERLSTSKRINNEMPET 
SSRSTSRSTSKRINYTTPQKLIEQKIEEINPHHY 

YITGIMDQRKYLILINTGQEENYV 
IRELIPEQEIVTIEQQNSELPKALRKNEETTEKE 

LIIGGIPILINFKIYQGDKNITLG 
IKWLEKVKPYKLEDRQLTISYENKKIIIKRTLI 

CDS 3321..5231 /note="ORF3" 

/codon-start=l 

/product="putative polyprotein 
(aspartic proteinase, reverse 
transcriptase, ribonuclease H)" 
/protein-id="CAB42622.1" 

/db-xfef="GI:4775496" 
/translation="MPKIYILSKIIVEGYYNRYY 

TPMVDTGAEANMCRHNCLPESKWE 
KLKTPIWTGFNNEGSMITYKARNIKIQIWDKIL 

TIEEIYSYEFTNKDILLGMPFLDK 
LYPHIITKTHWWFTTPCKQKLGAKRVNNKVRKTT 

PWIKGSEKITQKLENVIQSNHNIE 
IIIFSINKIKPLQDKLELLYNDNPLQGWEKHQTK 

IKIELIDENSIITQKPLKYNFNDL 
TEFKMHIKELLDNNYIQESNSKHTSPAFIVNKHS 

EQKRGKSRMVIDYRNLNAKTKTYN 
YPIPNKILKIRQIQGYNYFSKFDCKSGFYHLKLE 

DESKKLTAFTVPQGFYEWNVLPFG 
YKNAPGRYQHFMDNYFNQLENCIIYroDILLYSR 

TENEHIKLLEKFIHIVEISGISLS 
KKKAEVMKNQIEFLGIQIDKNGIKMQTHVVQKII 

NLNETLDTKKKLQSFLGLVNQVRE 
YIPKLAENLKPLQKKLKKDIEYHFDEKDKIHIQK 

IKNMCKKLPKLYFPDEKKQFTYIV 
ETDSSDHSYGGVLKYKYDNEKIEHHCRYYSGSYT 

EPQLKWEINRKELFGLYKCLLAFE 
PYIVYNKFIVRTDNTQVKWWITRKVQDSVTTKEI 

RRLVLNIQNFTFTIEVIRTDKNVI 



22 



ADYLSRQRYPNR" 
CDS 5 185.. 6393 /note="ORF4" 

/codon-start=l 

/product="putative translation 
transactivator/ inclusion body 
protein" 

/protein-id="CAB42623.1" 

/db-xref^"GI:4775497" 

/translation="MLLQTTYQDKGTQTDESQES 

KDIFTILTTLSLQMESMGKRLQQL 

ESQQHDYKNAELSQSEDSKLPEVEGDVGKLQKTH 

NTVALYTAAGTSKQVSKKPHINVN 
LNTVFDKPFTSKKPREAIVIAPQTSTYANSLHHN 

KKVYNHITQTYIENIYKIQTFLNL 
NPRSTTTTDPTQDYVTQKLQGYNRLIAQPKTKAN 

LVKTCYNYGLLSTX^HDGEEIIG 
IPELYKAFVTFKRITKGNLFFIKFYTAPAEILYD 

EIKPIIQVIKIGLTRDMIIPEEIE 
KQPEIQKMEIPSFYANmiGIATUQELANNYL 

QGNAIWSYYSRDQLMIYANSKELR 
QGDMDEVQKWILSLLKPEMQPTTRALKKEFISNE 

LLTRYCKLVGHKYPDHICSKCNGD 
DNYVPEVQLE" 
repeat-region 6446..6661 /rpt-unit=6446..6521 
/rpt-type=TANDEM 

SEQUENCE (SEQ): 

1 tggtatcaga gctatgacaa ataaacatat actttaagaa acaaaaccat ggaaaggaac 
61 actgacatta atagtacaaa cccgcaaaac agtgaaacaa tcaaaataac taagaaatcg 
121 cagaaaatta ggaagaaact aaagaaacta tatatagaat atgaatactt aagtatcaca 
181 aaaataaaca aagctaggct gtccaaatta atcgatataa tatctaaaac agaatataaa 
241 tatatttgtt atctaaagga taaaagatga ataaagaaga attcgcatta gaagagaaaa 
301 catatgagaa tccagaagga ttaaaaataa caataatatt ttctaactta ggaagaagat 
361 ataaaaaaat aggaaa^^^^ agaaactgta aaactagagg 

421 atagtttaac cgccatggtt agaataacaa aagaaaacga agaaatagat agaaaacgag 
481 agattaaaga aatacaacag caagctaaag aaaaaataca acagatagag gaagtaaaaa 
541 acactaaaat aacagaatta gaaaaagaat tagagatgct aaaacagatg tatgcaaata 
601 aactaaaaga aaaggaaaaa cgtaaagaag aagaagaaga actaaaacta acaaatgaga 
661 tagaaagatt caaattacag ttagaagaag tacaggagag tccatcaaaa ataagtataa 
721 acgaaataaa tgaccaaagt gataaagaca cagaactagg agaaaactat tcagaaacaa 
781 gtgaaacata cacagaactt atagaaaaaa cagaaaagat aaaaataaac ccagaaataa 
841 atacaggaga tatgaacgaa gataaaccaa gcatatcagg aataaaaaat ccaaaacaat 
901 taaacccaac ctattacaga gtaagttatg atgcatatga cagaaacaaa acattatggg 
961 ataaaaggtt aaataagaaa tgggcaccaa gacagataac tgaacaatat aattttttag 
1021 atctagattg tgtagcagat ataaataaaa caatacaatt atggatagga tatatctcaa 
1081 aacaactaat agataataaa ataacaataa cggaaacacc aggatatata gaaagaacat 
1 141 taataggaac tgtaaaatta tggttacaaa atttatcaag tgaaagctta gacacattaa 
1201 ggagtaataa aaaacttgac ggtacaacta caacaacagt tacagatata ttaaataaat 
1261 atgaaatagc aataagaaat gaatttagta gtatgacaac agaagtagaa gaacaaaata 
1321 aagaaaaaat tacaaataga aatttaatga caaaattagc aatatgtaat atgtgttata 
1381 tagatgaata tacttgtgca tttagagact attattataa aggaacatat agtccagatg 
1441 aaagtaaaga gataagaaaa ttatatttta caaaattacc agaacccttt agctcaaaaa 
1501 taataaaaag ttggaacgaa gcaggactag cagatacatt aggagtaagg ataaaatttc 
1561 tacaaaactg gtttatagaa ttatgtgaaa aatataaaga aaacatgaaa atggaaaaaa 
1621 tattagtaaa aaatttggca tgttgcaaaa gtagaatagc accccaattt ggctgcacag 
1681 ataaatatta caaaaaagaa ggaaagaaaa agaaatttaa atcaaaatat tcaaagtata 
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1741 aatataggaa accaagaaga agatattatg taaaaaatta taaacataaa aaaccatata 
80 ggaaafagaa aaaactaaca gaatgtactt gctataattg tggaaagcta ggacactt g 
86 ccaaagattg taaattacca aaaaacccaa agaagaaaca aattaccgaa atattgatag 
92 aitgataa atatacacaa gtagaatatg tagattatga attaagcagt gaagacagca 
98 Sagat atcagaaaac gagttctctg aaaatgaaat aaacaaggat atagaagagt 
204 cag Jgaaga aaattatgac tgaaaaagat atacaaatta tacaacaaga agaacaccaa 
210 gaSa^caat cttcagaaca aaaaataata tttgacgcta ataUtttga acaaataaaa 
2 6 fgaaaagaat tggatetaag tatagacaaa atattagaag taccaacaat aaagaattgg 
222 Saaaagac aaaaagaaga atactatgta gtaagccaaa gagaacatat aa agattgt 
228 TaaScatca aaggtaaagc acaaatacca attttaaata aaagactact aaataaagaa 
234 a aagSa aaaagcaL aaatccaata aaatatgtac acttaggagg aacggagatc 
240 a aaaag catgt^ag ggaaggaata gataccccta tagaaatata cttggcaga 
246 gataggattg tacaacctat agaaaagagt ataataagtg ctgtaagagg taactte ta 
252 faccLaaat ttaaatttat agtaagtgct aactattcag tagcagtaga tgataaaaat 
258 atagataaat cattagtatt atactggaaa atgtctggaa tagaattagc accaggaagt 
26^ aSattca cagcfagatg taaaaatcta tatgtcttaa caacaaaaca taagataaca 
2701 gctaaaaata aaataaataa aataaaaata gaaaatccat tcgaaagaat agtatcagtt 
276 ftagacaaca atgattacag ttatacagaa atagacatgg atgaagattt agaaatagta 
282 aaagaaagat mgcacatc aaaacgaata aataatgaaa tgccagaaac atcatcaaga 
288 agtacatcaa gaagtacatc aaaaagaata aattatacca ctccacaaaa attaatagaa 
294 iraaaatag aagaaataaa cccacatcat tattatataa caggaataat ggaccaa ga 
300 aaatatttaa tactaataaa tacagggcag gaagagaatt atgttataag agaactaata 
306 ccagaacaag agatagtaac aatagaacaa caaaattcag aactaccaaa agcgttaaga 
72 aaaScgaal af acaactga aaaagaatta attattggag gaataccaat attaataaat 

3 81 tttaaaafat atcaaggaga taaaaatatt --t^SS^Vr'^tla^^ 
3241 aaaccatata aattagaaga taggcaatta acaataagtt atgaaaa taa gaaaataata 
3301 ataaaaagaa ctttgatata atgccaaaga tatacatact «ccaaaata a 
3361 gatattataa tagatattat acacctatgg tggatacagg agcagaagct aatetgtgta 
342 Scataattg ttfaccagaa agtaaatggg aaaagctaaa aacccccata gtagtaacag 
481 gatttaataa tgaaggaagt atgataacat ataaagcaag a—- at-a^^^^^ 
3541 gggataaaat attaaccata gaagaaatat atagttacga attcacaaat aaagatatat 
3601 tatoggaat gccattttta gataaattat acccacatat tataacaaaa acacattggt 
366 ggtttactac cccgtgtaaa caaaaattag gagcaaaaag agtaaataat aaagtaagaa 
372 aLaacacc aS^^^^^^^^ 

3781 aaagtaacca taatatagag ataatcattt tctcaataaa taagataaaa cca acaag 
3841 ataaactaga attactatat aatgataatc cactccaagg atgggaaaaa catcaaacaa 
3901 aaataaagat tgaactaata gatgaaaata gcataataac acaaaaacc ttaaaataca 
3961 attttaatga tttaacagaa tttaaaatgc atataaaaga attattagat aataactaca 
402 tacaagaaag taatagtaaa catactagcc cagcatttat agtaaataag catagtgaac 
4m aaaaaigagg aaaalgccgt atggttatag attatagaaa cttaaatgca aaaacaa aa 
4141 catataatta tccgatacca aataaaatac taaaaattag acaaatacaa ggatataact 
4201 attttagtaa atttgactgt aaatcaggat tttaccattt aaaactagaa gatgaatc^a 
4261 aaaagttaac agcattcaca gtaccacaag gattttacga atggaacgta ttaccmtg 
4321 gatamaaaa tgcaccaggt aggtatcaac attttatgga taattacttc aaccaattag 
4381 aaaattgtat aatatatata gatgatatat tgctatattc tagaacagag aacgaacata 
4441 taaaactact agaaaaattc atacacattg tagaaatatc aggaataagt ttaagtaaaa 
450 agaaagcaga agtaatgaaa aatcaaatag aatttttagg tatacaaata gataaaaacg 
4561 gaataaaaat gcaaacccat gtagtacaaa aaataattaa cttgaatgaa acacttgata 
462 caaaaaagaa gttacaatca tttttaggat tggttaacca agtaagagaa tatat^:c a 
468 aattagcaga aaacttaaaa ccattacaga aaaaattaaa aaaggacata ga« 
474 ttgacgaaL agataaaata catatacaga agataaaaaa tatgtgtaaa aaattaccaa 
4801 aact^ttt tccagatgaa aagaaacaat ttacatatat tg^agaaact gattctagtg 
4861 atcacaecta cggaggagtt ctaaaatata aatatgataa tgaaaaaatt gaacaccatt 
4 2 ^aga«c?ggala tacacggaac cacaattaaa atgggaaata aataggaaag 
4981 aacmtttgg attatataaa tgtttgttag catttgagcc atatattgtt tataacaaat 
5041 ttatagtaag aacagataat acacaagtaa aatggtggat aactcggaaa gtacaggatt 
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5101 cagtaactac aaaggaaata aggagacttg tattaaatat acaaaatttt acatttacaa 
5161 ttgaagtaat acgaactgac aagaatgtta ttgcagacta cctatcaaga caaaggtacc 
5221 caaacagatg aaagccaaga atcaaaagac atatttacaa tccttactac tctatcctta 
5281 cagatggaga gtatgggaaa aagattacaa cagctagaaa gtcagcagca tgactataaa 
5341 aatgcggagc taagtcaatc ggaagactct aaacttccag aggtagaagg agacgttggg 
5401 aaactccaaa aaacccataa cacagttgct ttatatacag ctgcaggtac aagcaaacaa 
5461 gttagcaaaa agccacatat caatgtaaac ctaaataccg tatttgataa gccatttaca 
5521 tcaaaaaagc caagagaagc aatagttata gccccacaaa cttcaaccta tgctaatagc 
5581 ctacaccaca acaaaaaggt atataaccat attactcaaa catatattga gaatatatat 
5641 aaaatccaga catttctgaa cctaaaccct agatcaacta ctactacaga tccaacacaa 
5701 gattatgtaa cccaaaaact acagggatat aataggcrta tagcccagcc aaaaacaaaa 
5761 gcaaacctag taaaaacatg ttacaactac ggactactta gcacagtata tacccatgac 
5821 ggagaagaaa taattggaat accagagcta tacaaagcat ttgtcacctt caagagaatt 
5881 acaaaaggca acctattctt catcaaattc tatacagcac cagcggaaat actatatgat 
5941 gaaataaaac ccataataca ggtgataaag ataggactaa cacgagatat gataataccg 
6001 gaagagatag agaaacaacc ggagatacag aaaatggaga taccaagttt ctatgccaat 
6061 aaaagaataa ttggtatagc aactattatt caagaactag ctaacaatta tctacaagga 
6121 aatgctatct ggagctacta ttccagagac cagttgatga tatatgccaa ctcaaaggaa 
6181 ctacgacaag gagatatgga tgaagtccag aaatggattt tatcattatt aaagccagaa 
6241 atgcaaccaa ctaccagagc attgaaaaaa gaatttattt caaacgagtt attaacaaga 
6301 tactgcaaac tagtaggaca caaatatcca gaccacatat gttcaaagtg caacggagat 
6361 gataactatg taccagaagt ccaactagaa tgaaggtatc aacaagaaga caaaagaaga 
6421 agacagctac tggaaacata taatgtaaag taaatagtac tagtcacatt catgaacagt 
6481 aaaggtcgtt catgaatagt aagagtcatg taaaaaagtc gtaaagtaaa tagtatatgt 
6541 cataatcatg aacagtaaag gtcgttcatg aatagtaaga gtcatgtaaa aaagtcgtaa 
6601 agtaaatagt acgagtcata atcatgaaca gtaaaggtcg ttcatgaata gtaggagtca 
6661 tttgtaaaca gtaagagtcg ttttaatttt ctttatatag aatgtaaatc tgaggaagga 
6721 ggacatcctc agacatcctc atcctcacct tctctctctt atcttctctc aataaaatat 
6781 ctgattatat acaaacttct gaaagctatg gagcattcaa acgagttaca aaaccaggta 
6841 agtttattta taatcatgtt taactaaact cttatattcc ttataatctc ttgaatatca 
6901 taattgttta tcatgttata gtactgttat aaagaacatc ttgagaaagt ttgcctgtct 
6961 aataatgctg agagtagtta agcccagact atgccatcct aaagtggaaa ccagtaggca 
7021 aggaagccgt ttaggggagt acacagagtt gaggcgctgt tagggtaaat gattgaagca 
7081 tpaaaacat ggtagtgacc agaaaagatt gttcagtata acttattaaa atat^ 
7141 ctctatgata aacaaagagg ttagagggaa tatgtttagt aaacacatga taaggataaa 
7201 tagtaaatct gaatgaacaa ccacacgtat ttattagaag aaaatattga ctacaaaata 
7261 cttatgttat atatctttaa gagacttaat aacgatgtta aaagaaaaat ttacgatatt 
7321 ttagttactt ttgaaataat atatgtaatg gaacaagcat ttacagaact aattgtatca 
7381 catgaaatag atatattaga tctatatgaa ctagatttat attcagacta atgatcacat 
7441 atcaagttaa taaaatcttt ttataaatga aacataacac ttttcattat ataagattct 
7501 ataataccca gaaatggaaa cttcttaaaa acctcagtaa cataaataga aaaatagatt 
7561 gtgttaaata caatattaaa acctgcaaag atattattag atataataga ttacgtaata 
7621 aagctttatt aactatagaa aatgaaattg aattttatga agataaactt gaaagttatc 
7681 aacacagaaa agaaataact ctgagaaata tagaaaccat ggatatatgt atcaacaggt 
7741 atacatctca gtattaatat gaacaaatca aaaattgata gttacaaata ttttaagtac 
7801 tggttagaaa ttctaaaaca tataaatatg tcaaaaataa tattagaaaa aaatatagaa 
7861 gaatatcaaa aaactaaaga tattaaatac ttagaataca cacttgaaaa tataaaagaa 
7921 atttccaaca taaaaaagat cctatgaaaa ataagatgaa ctcaaacaac cacccccaaa 
7981 t 

L8 ANSWER 139 OF 146 GENBANK.RTM. COPYRIGHT 2001 

LOCUS (LOC): U95208 GenBank (R) 

GenBank ACC. NO. (GBN): U95208 

CAS REGISTRY NO. (RN): 197678-22-1 

SEQUENCE LENGTH (SQL): 7206 

MOLECULE TYPE (CI): DNA; linear 
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DIVISION CODE (CI): Viruses 
DATE (DATE): 28 Jun 2001 

DEFINITION (DEF): Petunia vein clearing virus, complete genome. 
SOURCE: petunia vein clearing virus. 

ORGANISM (ORGN): petunia vein clearing virus 
Viruses; Retroid viruses; Caulimo virus 
NUCLEIC ACID COUNT (NA): 2383 a 1470 c 1285 g 2068 t 
COMMENT: 

On Jun 28, 2001 this sequence version replaced gi:2463654. 
REFERENCE: 1 (bases 1 to 7206) 

AUTHOR (AU): Richert-Poggeler,K.R.; Shepherd,R. J . 
TITLE (TI): Petunia vein-clearing virus: a plant 

pararetrovirus with the core sequences for an 
integrase function 
JOURNAL (SO): Virology, 236 (1), 137-146 (1997) 
OTHER SOURCE (OS): CA 127:327178 
REFERENCE: 2 (bases 1 to 7206) 

AUTHOR (AU): Richert-Poeggeler,K.R.; Hohn,T. 
TITLE (TI): Isolation of an infectious full-length clone of petunia 
vein clearing virus (PVCV) from infected Nicotiana 
glutinosa 
JOURNAL (SO): Unpublished 
REFERENCE: 3 (bases 1 to 7206) 

AUTHOR (AU): Richert-Poeggeler,K.R.; Shepherd,R.J. 
TITLE (TI): Direct Submission 

JOURNAL (SO): Submitted (26-MAR-1997) Agronomy, University of 
Kentucky, Lexington, KY 40546-0091, USA 
REFERENCE: 4 (bases 1 to 7206) 

AUTHOR (AU): Richert-Poeggeler,K.R.; Shepherd,R.J. 
TITLE (TI): Direct Submission 

JOURNAL (SO): Submitted (28-JUN-2001) Agronomy, University of 
Kentucky, Lexington, KY 40546-0091, USA 

FEATURES (FEAT): 
Feature Key Location Qualifier 

===^===^====+=========-=====+-========= 

source 1..7206 /organism- 'petunia vein clearing 

virus" 

/specific-host="petunia hybrida 
cv. Himmelsroeschen" 
/db-xref^"taxon:59504" 
/note="isolated from Nicotiana 
glutinosa which can be infected 
with the virus via grafting" 
CDS 5 1 ..6590 /note="contains putative movement 

protein, RNA binding domain, 
protease, reverse transcriptase, 
and RNase H" 
/codon-start=l 

/product="ORF I polyprotein" 

/protein-id=" AAK68664. 1 " 

/db-xref="GI: 14574598" 

/translation="MTSPSDYQSNSSLATTYSNA 

PKLSKALSNKYDYLYEVDILKENQ 

KISDTYLPLLNPYSAFAKRSVTPWSQIRSLVQSK 

PRHVKEYVAASKLDQHPVFATGEE 
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QFVTLHIPEEFASHWKSHQFTHIHFGAVKIALTY 

HGRKGQPWARLALLDTRYLEYQH 

ANLGTAEITLNAGTVFITLFPNFTMSLSDANLST 

ALKIQVQIQGAPLTKDSIQATLHY 
QIAWRVQNHAMDLTLPGGEEALFLKIDAGSGATQ 

CTQVPRQLSKEDLIKILPDSWVTN 
YEKLREPEEPLRSTEVSMSKRHDKSVAISFDHSH 

YKKLRNTHHFMGMISDDVIVLDDP 
ETFSKTLPSLMQTHDWIHHFQLDGRAVSWYKDPF 

DGHCPWDIDCQCYSCLYSEDEEDF 
EDGFPTKYKGIPRPGSIAERKMQEEANLKKLYEE 

KDPFVGSLSRPGKYEYLVRYDAPS 
WAKDPHLTVEPTGWDSDEPffPKQPFTTRNTLPK 

lYMFNPLNYENNFPPLSSFSKDGA 
DHTPKIPKRNVVLPSGAKDPTGDLEATVNWQTEN 

ALAQNRMLTTIDRTLKETVTKVDR 
VTDQSSKNQGLIKVLEQQLQDLNKRICPPGTSLF 

HFFDQQKSEMASLKEQIRLLKEQP 
QKNETDTPSYQSSYQPFHSFSSPYMPSNPPNSPF 

TTFANTPQPQPSLFSQYPIQPKSP 
NTFDLAKLVWEKKDAIAAEKRAKKKLQKDEVKQK 

TSLPPESKRPDPQSSSHLGDQFMI 
SDPALPKVYALNEPSVPSEDTSSQSYISTEESVE 
DTDSFSWSEESTQLSQLSSSSND 
SPENNENTLPQTFMVRPTEPEISEVEDEVDGMTE 

EPIPERRPEITPPKMVGTGFHTFS 
LDDISITKWPERIQDFHTWMLTKQLVEREPFLIL 

SEFTARLSGTLREWWNSVGPDDKN 

RFLTSQDFTWNIRILYSYFCGDQSQNKEELRRQI 

FEMKCLSYDRKKIDRHFQRMIKLF 

YHIGGDISLKQAFISSLPPILSERISALIKERGT 

SVTQMHVGDIRQTAFYVLDDLCSK 

RKFFNQMKKMSRDLEKACTKSDLIIKGDKGCSGY 

CNPSRRRKYKRFRLPSFKERDGRQ 
YRKRRRFFRRSKTSKAMRQKPRSCFTCGKIGHFS 

RNCPQNKKSIKLISEIQKYTGIDI 
EDDLESVFSIEDEPSEDTLFSLEFYEEYAGEQYQ 

ITSYEAPKTENPPLPKIHTIVEIP 
QTEVKIYTSKWDKPISVIAFYDTGAAYSIMDPAI 

LPSEYWIPHFRHFGTADDGILTTT 

VKTKHPITIEFFPGFKYTTKLLGSDIPGKDLLIG 

FDIYRQLNNKLRIGADGIRWKNQF 

KRYTEIPRLFQLTTSNELQQLEDVIKNQLCADSH 

VDFLSKCSHPLWLNQDFFIQLPFK 

KNENINPTKASHSGMNPEHLQLAIKECDELQQFD 

LIEPSDSQWACEAFYVNKRSEQVR 

GKLRLVINYQPLNHFLQDDKFPIPN 

KLTLFSHLSKAKLFSKFDLKSGFWQLGIHPNER 

PKTGFCIPDRHFQWKVMPFGLKTAPSLFQKAMIK 

IFQPILFSALVYIDDILLFSETLE 

DHIKLLNQFISLVKKFGVMLSAKKMILAQNKIQF 

LGMDFADGTFSPAGHISLELQKFP 

DTNLSVKQIQQFLGIVNYIRDFIPEVTEHISPLS 

DMLKKKPPAWGKCQDNAVKQLKQL 

AQQVKSLHIPSEGKKILQTDASDQYWSAVLLEEH 

NGKRKICGFASGKFKVSEQHYHST 
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FKEILAVKNGIKKFNFFLIHTNFLVEMDMRAFPK 

MIRLNPKIVPNSQLLRWAQWFSPY 

QFEVKHLKGKDNILADFLSRPHEFSQRLKNSPKV 

LMFQRRTRSSSTKSKADSSQSTGS 

SYKLSHNLPENPPEVFNLDYPWDTSVFLERRTFY 

ELQVFKKYGGSILRPFGVDPEYPF 

AHIFIPNPTDFSEDLLWMFWYLLNHFHILMKFRC 

SKFSKIDQVNPWMLKFLLWFNNHN 

YWASLFKCMKGIKKYVVIWFYRPVNYYQGKLCAL 

PHSSIVKWNHVSVLNDEDEYSELQ 

RFIFQENKCIPKEIWPGSSGSWNYGNSDHPHGQW 

IRDALREYREMNDYFQDAQDPYPA 

YSKVDLTQEELNTLRITRSYGSSSEDADMVKRSI 

YTVQSNIVKDSPRKRKGKAKSRSS 

TRSEKRRAKNKCKYRSLHGEDWWIELGYSTKPST 

PSWTQDSSSEPCV" 

SEQUENCE (SEQ): 

1 tggtatcaga gccatccatt ttgttgaggg aggaaacatt tctcaccata atgacttctc 
61 cttctgatta tcaatccaat tcctctctgg ctaccactta ttctaatgca cctaaactct 
121 ctaaagcttt gtctaataaa tatgactacc tttatgaagt agatatcctt aaagaaaacc 
181 aaaaaatatc agatacctat cttcctttgt taaatcctta ctctgctttt gcaaagagat 
241 ccgttacccc ttggtctcag atccgttcat tagtccagtc taagcctcgc catgtgaaag 
301 aatatgttgc tgcttccaaa ttagatcagc atcctgtctt tgctacagga gaagaacagt 
361 ttgttactct acatattcca gaagaatttg cttctcactg gaaatcccac caattcactc 
421 atatccactt tggtgctgtt aaaattgccc ttacctacca tggaagaaag ggtcaacctg 
481 ttgttgccag attagcactc ttagatacta gatatcttga ataccaacat gctaatcttg 
541 gaactgctga aatcactctc aatgcaggca cagtcttcat tactctcttc cctaacttta 
601 ccatgtcttt atcagacgcc aatctgtcca cagccttaaa aatccaagtc caaatccaag 
661 gagctcccct tacaaaagac tccatccagg caactttaca ttaccaaatt gcttggcgtg 
721 tccaaaatca tgctatggat ttgactctcc ctggaggtga ggaagccttg ttcctcaaaa 
781 ttgatgccgg aagtggagcc acccaatgta cccaagtccc tagacagttg tcaaaagaag 
841 atctaatcaa aatattgcct gattcttggg taacaaatta tgaaaagtta agggaaccag 
901 aagaacctct ecgttccact gaagtctcaa tgtcaaagag acatgataaa tcagtagcta 
961 tcagttttga tcattcccat tataagaagc tcagaaacac ccatcacttc atgggtatga 
1021 tttcagatga tgtcattgtc ttggatgatc ctgaaacttt ttccaaaact cttccatctc 
1 08 1 tcatgcaaac tcatgattgg atccatcatt tccaacttga tggacgtgct gtatcatggt 
1141 ataaagatcc ttttgatggt cattgtccat gggatattga ttgccaatgt tattcttgtt 
1201 tatacagtga agatgaagag gattttgaag atggatttcc tacaaaatac aaagggattc 
1261 ccagaccagg aagtattgct gaaagaaaga tgcaagaaga agctaatctc aagaaacttt 
1321 atgaggagaa ggatcctttt gtgggatcat tgagcaggcc aggcaaatat gagtatctcg 
1381 ttagatacga tgccccatct tgggccaaag atcctcatct gacagtggag ccaacaggtt 
1441 gggatagtga tgaaccaatt ccacctaagc aaccattcac aaccagaaat accttgccaa 
1501 aaatttacat gttcaaccca ctcaattatg aaaacaactt tccacctttg tcatcatttt 
1561 ctaaagatgg tgcagatcat actccaaaaa ttccaaagag aaatgttgtt ttaccatctg 
1621 gtgcaaaaga cccaactggt gatcttgagg caactgtaaa ttggcaaaca gaaaatgctt 
1681 tggctcagaa taggatgtta acaaccattg ataggaccct caaggagact gtaaccaagg 
1741 ttgatagagt cacagatcag tcctcaaaaa atcaaggtct tatcaaagtt ctcgaacaac 
1801 agctccaaga ccttaacaag aggatatgtc ccccaggaac atccttattt cacttctttg 
1861 atcaacaaaa gagtgaaatg gcatccctta aagaacaaat cagacttctc aaagaacaac 
1921 cacagaaaaa tgaaactgat accccatctt accagtcctc atatcagcct tttcacagtt 
1981 tctcttcccc atatatgccc tcaaatccgc ctaactcacc gttcacaact tttgcaaata 
2041 ctccacaacc acaaccttct cttttttctc aatatccaat acagccaaaa tccccaaata 
2101 cctttgactt agccaaacta gtctgggaga aaaaagatgc aattgctgca gaaaaaaggg 
2161 ccaagaagaa gctacaaaaa gatgaggtaa agcaaaaaac ttctttacct ccagaatcaa 
2221 aaaggccaga tccccagtct agcagtcacc ttggtgatca gttcatgatt tctgatccag 
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2281 ctttaccaaa agtttacgcg ctgaatgaac cttcagttcc ttcggaagat acatcctccc 
2341 aatcatacat cagtactgag gaatcagtag aagacacaga ctctttctct gtagtctcgg 
2401 aagaatcaac ccaactttct caactctcca gttcttctaa tgattcacca gaaaacaatg 
2461 aaaatacact acctcagacc tttatggttc gtccaactga accggaaata tctgaagttg 
2521 aagatgaggt agatggtatg actgaggaac caattcctga aagaagacct gaaatcactc 
2581 cacctaagat ggttggcaca ggctttcaca ctttcagcct agatgatatt tccattacca 
2641 aatggcctga aagaattcag gatttccaca catggatgct cactaaacaa ctggtggaaa 
2701 gagaaccatt cttaatcctg tcagaattca cagcaagatt gtcaggaaca cttcgagaat 
2761 ggtggaattc tgtaggacca gatgacaaaa acaggttctt gacttctcaa gacttcacat 
2821 ggaatattag aatactttat tcttatttct gtggagatca aagtcaaaac aaggaagaac 
2881 tcaggagaca aatcttcgag atgaagtgtt tgtcttatga caggaagaaa atagatcgtc 
2941 atttccaaag aatgatcaag ctcttttatc acattggagg agatatttct ttaaagcaag 
3001 ccttcatatc ttcccttccc cccattttat cagaaaggat ctctgccttg ataaaagaaa 
3061 ggggtacttc agttacacaa atgcatgttg gagatattag gcaaacggct ttttacgtcc 
3121 ttgatgatct ctgtagcaaa agaaaattct tcaaccaaat gaagaagatg tctagagatc 
3181 tggaaaaagc atgcaccaaa tctgacctta ttattaaggg tgataaagga tgttcaggct 
3241 attgtaatcc ttcaagaaga aggaaataca agcgcttcaa acttcctagc ttcaaggaac 
3301 gagatggcag gcaatacagg aaaagaagga ggtttttcag aagaagcaaa acaagcaagg 
3361 caatgaggca aaagcctaga tcttgtttta catgtgggaa aattggtcac ttctccagaa 
3421 actgccccca aaacaagaag tcaataaaat tgatttctga aatccaaaag tatactggaa 
3481 ttgatattga agatgacctg gagtccgttt tctcaattga ggatgaaccc agtgaggata 
3541 cattattctc gctagagttt tatgaagaat atgcaggaga gcaataccag atcacttctt 
3601 atgaagctcc aaaaacagaa aaccccccat tgcctaaaat ccacaccatt gttgaaatac 
3661 cacaaacaga ggtcaagatc tacacctcta aatgggataa gcctatctca gtcattgctt 
3721 tttatgatac aggagcggcc tattcaatta tggatccagc aattcttcca tctgaatatt 
3781 ggattccgca tttcaggcac tttggaacgg ctgatgatgg tattttgaca accacagtca 
3841 aaaccaagca tcccattaca attgaattct tcccaggctt caagtacact acaaaattgt 
3901 tgggatcaga tattcctgga aaagatctgc tcattggatt tgatatctac agacagctta 
3961 ataataagct tcgcattgga gcagatggaa tcagatggaa aaaccagttc aaaaggtata 
4021 ctgagatacc acgacttttc cagttaacaa caagcaacga gcttcaacag cttgaggatg 
4081 ttatcaaaaa tcaactctgt gcagattctc atgttgattt cctcagcaaa tgttctcatc 
4141 cattatggct taatcaggat ttcttcatcc agctcccctt taaaaagaat gaaaacatca 
4201 atcccacaaa ggcaagccat tctggaatga atccagaaca tctccagtta gcaataaagg 
4261 aatgtgatga gcttcaaQaa tttga^^^ tcgaaccctc tgattcacaa tgggcatgtg 
4321 aagcttttta tgtcaacaag aggtcagagc aagtgagagg aaaattaagg ttagtcataa 
4381 attatcaacc attaaaccat ttcctccaag atgacaagtt tcccattcca aataagctca 
4441 cactcttctc tcacttatcc aaagcaaagc tcttttcaaa gtttgatctt aagtcaggat 
4501 tttggcaatt aggaatccat cctaatgaga gaccaaaaac agggttttgc attcctgata 
4561 gacattttca atggaaggtc atgccattcg gtcttaaaac agccccttct ttgttccaaa 
4621 aagccatgat caaaatcttc cagcccatcc ttttttcagc tttggtctac attgatgata 
4681 tccttctttt ttctgaaaca ctggaagatc acataaaatt gcttaaccag ttcataagtt 
4741 tggttaaaaa atttggagtc atgttatcgg caaaaaagat gatccttgct caaaataaaa 
4801 ttcaattctt aggaatggat tttgctgatg gaactttttc acctgctggt cacatctctt 
4861 tggaactcca aaagtttcct gatacaaatc tctcggtaaa gcagatacaa caatttcttg 
4921 ggattgttaa ttatatcaga gattttattc cagaggttac tgagcatatt tcccctctta 
4981 gtgatatgct aaagaagaaa ccaccggcat ggggaaaatg tcaagacaat gcagtcaagc 
5041 aattgaagca actggctcaa caagtcaaat cattacacat cccttctgaa ggaaagaaaa 
5101 tacttcaaac tgatgccagt gatcagtatt ggagcgctgt actattggaa gaacataatg 
5161 gcaaaaggaa aatctgtgga tttgcaagtg gaaaatttaa ggtctctgaa caacattatc 
5221 actccacatt caaagagatc cttgctgtta agaatggaat taaaaagttt aatttcttct 
5281 tgatccatac caactttttg gtcgaaatgg acatgagagc atttccaaag atgatcaggt 
5341 taaatcctaa gattgttcct aattctcagt tgctacggtg ggcacaatgg ttttctccct 
5401 accaatttga ggtaaaacat ttaaagggaa aggacaatat ccttgctgat ttcctatcaa 
5461 ggccacatga attttcacaa aggcttaaga attcacccaa ggttcttatg tttcaaagga 
5521 ggacaagatc aagctcaaca aaaagcaagg ctgactcaag ccaaagtaca ggatcaagtt 
5581 ataaattatc tcataatctt cctgaaaacc ctccggaagt ttttaatcta gattatccat 
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5641 gggatacgtc tgtttttcta gaaagaagaa cattctatga actacaagtt ttcaagaaat 
5701 atggaggatc tattttaagg ccttttggag ttgatccaga gtatccattt gctcacatct 
5761 ttatacctaa tcctacagat ttctctgaag atttattatg gatgttttgg tatctgctca 
5821 atcatttcca catcctcatg aaattcaggt gtagtaaatt cagcaagatt gatcaagtta 
5881 atccatggat gttgaagttc ttattatggt tcaataatca taattattgg gcttcattat 
5941 tcaagtgcat gaagggaata aagaaatatg tagtaatttg gttttacaga ccagtaaatt 
6001 actaccaagg caagctttgt gctttacccc attcaagtat tgtgaagtgg aatcatgtct 
6061 ccgttttaaa tgatgaagat gaatactctg agctacaaag attcattttt caagagaaca 
6121 agtgtattcc gaaagaaata tggccaggat cctcaggcag ttggaattat ggaaattcag 
6181 atcatcctca tggccaatgg attagagatg cacttaggga atatcgagaa atgaacgatt 
6241 actttcaaga tgcacaagat ccatatcctg cttattctaa agttgatctc actcaagaag 
6301 aactaaacac cttacgtatc acgcgatcgt atggtagtag ttctgaagat gcagatatgg 
6361 tgaaaagaag tatttacaca gtccaatcca atatagtcaa ggattctcca agaaaaagaa 
6421 aagggaaagc aaaatcaaga tcatccacca gaagcgaaaa aagaagagct aaaaacaaat 
6481 gcaaatatcg gagtcttcat ggagaagatt ggtggattga attgggatat tctacaaagc 
6541 catcaacccc ctcatggaca caggacagct catcagaacc atgtgtctaa agcatgcgtg 
6601 aagaagataa gagggaatcc acttttcagt agtggtccta tgtgtctaaa gcatgcgtga 
6661 agaagataag agggaatcca cttttcagta gtggcttatt actttaaata aagtttgtct 
6721 agtctacttc tgtctttatt aaaagtaagg aatctcttgc tttttctatt atggcatttt 
6781 gtaataatgc tctgtccact atcgagtctg tattattaag tgtgataagt gtgctgtcat 
6841 gatgtaagtg cttacgtctt gagtcataga gattgcctat ataaggtaac ctcattagtt 
6901 gttttcgatc agaaccaact ttcctttgta aaatattatc atcaataaaa tgtttgtgtg 
6961 aataactctc tcttgtttcc aaactctcaa aatggtagtt cgatcctaat attaaattga 
7021 aggtgattac tctataagag tgattactga agatagtagg tcagattaag gttattgact 
7081 tggtattatg gctgtatggt gccgtgttaa agcccagtag tctaacaaaa catctgatct 
7141 agttatcctc agtttaatca gaattctctt ataggttcaa tcttataatt cctccatatc 
7201 tcgtat 
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